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FOREWORD 


The  generation,  validation,  and  correlation  of  numerical  data  is  widely  recog¬ 
nized  as  a  centrally  important  aspect  of  scientific  and  technical  activity.  The 
technical  community  exploring  and  exploiting  chemical  knowledge  has  taken  a 
leading  position  in  developing  the  collation,  evaluation,  correlation  and  extrap¬ 
olation  techniques  required  for  use  of  large  sets  of  interrelated  numerical  data. 
The  content  and  format  of  these  chemical  data*  compilations  reflect  a  consid¬ 
ered  effort  to  meet  the  information  communication  requirements  of  specific 
user  populations. 

The  complexity  of  the  present  chemical  information-communication  matrix 
is  difficult  to  describe  analytically.  Projection  of  a  more  effective  future  sys¬ 
tem  from  this  non -quantitative  reference  base  is  obviously  even  more  difficult. 
However,  numerical  chemical  data  compilations  existing  in  the  matrix  are  ame¬ 
nable  to  systematic  characterization.  Thus,  they  can  facilitate  a  more  quanti¬ 
tative  definition  of  existing  chemical  data  communication  systems  and,  subse¬ 
quently,  criteria  for  improved  systems. 

Data  compilations  are  technically  and  economically  viable  in  actual  chemical 
communication  situations.  It  appears  axiomatic  that  the  chemical  data  and  in¬ 
formation  systems  of  the  future  must  be  based  on  an  awareness  of  the  capabi¬ 
lities  and  limitations  of  present  practices  and  resources. 

It  is  widely  recognized  that  the  availability  of  high  capacity  automatic  data 
processing  and  transmission  equipment  has  opened  new  opportunities  for  the 
construction  and  integration  of  files  of  data  on  chemical  substances.  Currently, 
large  scale  systems  are  under  consideration  for  handling  chemical  information 
and  data.  These  include  the  Chemical  Information  Program  coordinated  by  the 
National  Science  Foundation,  the  Chemical  Information  and  Data  System  develop¬ 
ment  program  of  ti.  .•  Department  of  the  Army,  and  the  National  Standard  Ref¬ 
erence  Data  System  directed  by  the  National  Bureau  of  Standards.  Each  of  these 
programs  includes  efforts  toward  the  development  of  methods  of  analyzing  data 
files  and  data  flows  to  obtain  information  needed  to  plan  improved  systems. 


*  In  the  context  of  this  report,  chemical  data  is  defined  as  quantitative 
information  concerning  the  physical  and  chemical  properties  of  chemically 
* idhentifiable  substances  or  systems. 
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Chemical  data  handling  by  automated  equipment,  to  date,  has  paralleled  early 
business  data  handling  in  that  it  has  been  pursued  in  a  fragmented  manner. 

During  the  early  phases  of  applying  automated  equipment  to  the  handling  of 
business  data,  separate  files  were  often  maintained  to  service  a  limited  set 
of  user  requirements.  Other  files  containing  much  of  the  same  data  were  main¬ 
tained  to  serve  other  user  requirements,  resulting  in  fragmentation  of  the  total 
data  file.  More  recently,  integrated  data  processing,  which  combines  files, 
eliminates  or  reduces  duplication  of  data  items,  and  serves  ail  users  from  fewer 
data  files,  has  pro"*  «  more  efficient  in  business  data  processing.  Conditions 
contributing  to  this  ;  aige  in  business  data  processing  included: 

•  Devekipr  Knt  of  computers  with  larger  storage  memories  and 
faster  ,  recessing  capabilities,  and 

•  Development  of  methods  of  analyzing  data  files  and  data  flows 
to  obtain  information  needed  to  plan  improved  systems. 

The  applicability  of  integrated  data  processing  techniques  to  chemical  data 
handling  and  user  servicing  is  largely  an  unresolved  question,  A  major  objective 
of  NSF  Contract  C-478  is  the  development  and  application  of  a  methodology  for 
analysis  and  characterization  of  Chemical  data  compilation  activities.  The  basic 
analysis  technique  which  we  tested  was  the  examination  of  each  data  compilation 
as  a  data  file  and  its  substance  and  property  coverage.  However,  because  of 
the  complex  interactions  which  exist  between  a  data  compilation  activity  and  the 
environment  on  which  it  is  operated  and  die  technical  activity  which  it  supports, 
our  study  also  examined  operating  statistics,  operating  motives,  users  served 
and  other  factors  that  might  shed  light  on  basic  design  factors. 


I.  Introduction 

The  National  Science  Foundation,  which  is  sponsoring  the  National  Chemical 
Information  Program,  contracted  with  Science  Communication,  Inc.  ,  under 
contract  number  NSF  C-478,  for  assistance  in  analyzing  existing  compilations 
of  data  on  the  physical  and  chemical  properties  of  chemical  substances  and 
systems.  Our  work  scope  was  initiated  as  a  study  of  the  structure  of  existing 
chemical  data  compilations,  but  due  to  new  developments  in  the  National  Chem¬ 
ical  Information  Program,  the  emphasis  was  shifted  towards  obtaining  a  greater 
amount  of  information  relevant  to  the  design  elements  associated  with  the  actual 
development  of  a  national  chemical  information  system.  Our  activities  con¬ 
sequently,  were  phased  with  those  of  the  Chemical  Abstracts  Service  in  the  de¬ 
velopment  of  a  chemical  compound  registry  system,  and  with  system  pre-design 
studies  currently  under  way  for  the  National  Science  Foundation  by  Information 
Management,  Inc. 
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Prior  to  discussion  of  the  findings,  it  would  be  desirable  +o  place  cur  study  ef¬ 
fort  in  its  context  among  the  several  existing  modes  of  communication  of  chem¬ 
ical  data  from  generator  to  user, 

A  community  of  interest  exists  when  individuals  and/or  organizations  identify 
themselves  with  a  common  goal.  In  scientific  and  technical  endeavors,  such 
goals  are  often  referred  to  as  missions.  In  this  context,  missions  are  not 
necessarily  applications -oriented.  One  norma1  manifests tion  cf  a  community 
of  interest  is  the  development  of  an  effort  lo  generalize,  organize  and  conserve 
the  data  required  to  pursue  the  mission.  This  effort  may  be  well  -articulated 
and  formally  structured,  or  it  may  be  informally  structured  and  hardly  discer¬ 
nible.  Various  communities  of  interest  have  developed  chemical  data  communi 
cation  systems  which  operate  in  one  cr  more  of  the  following  linkages: 

Generator  to  user 

Generator  to  document  publisher  to  user 

Generator  to  document  publisher  1o  document  processor  to  user 
Generator  to  document  publisher  to  document  proc  essor  to  data 
processor  to  user 

Generator  to  data  processor  to  user 

Generator  to  data  processor  to  document  processor  to  user 

Our  study  effort  was  restricted  to  the  consideration  of  the  data  processor  ele¬ 
ment  of  the  system  and  the  interfaces  between  the  data  processor  and  the  ele¬ 
ments  which  provide  input  or  receive  output.  Input  and  output  elements  for 
the  data  processor  elements  depend  on  the  operating  mode  cf  a  system.  In 
many  systems,  a  data  processor  (or,  in  our  project  terminology,  a  data  com¬ 
pilation  activity)  may  operate  simultaneously  as  an  element  of  mere  than  one 
system.  That  is,  a  data  compilation  activity  often  performs  several  commu¬ 
nication  functions  simultaneously. 

II.  Study  Procedure 

We  have  studied  a  significant  number  of  data  compilations  and  their-  respec¬ 
tive  centers  for  obtaining  quantitative  information  representative,  of  their  c-ur 
reit  activity.  Parameters  studied  include  those  representing  scientifically 
meaningful  parcels  of  information  as  well  as  the  fundamental  elements  of  such 
parcels.  Data  of  the  following  nature  were  collected  or  generated  on  selec  - 
ted  data  compilations: 

(1)  Coverage  of  chemical  substances  including  types  of  chemical  sub¬ 
stances  (pure  chemicals  or  mixtures,  elements  or  compounds  in¬ 
organic  or  organic  compounds,  etc.  ». 

(2)  Mode  of  representation  of  chemical  substances  <CA  name,  molec¬ 
ular  formula,  structural  formula  code  etc.). 
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(3)  Number  and  identity  of  properties  covered. 

(4)  Format  of  data  compilation. 

(5)  Parameters  or  units  of  measure  of  the  property. 

(6)  Number  of  associated  variables  in  the  file  record. 

(7)  Reference  citing. 

(8)  Extent  of  validation  or  evaluation  of  the  property  values  pre 
sented  in  the  output. 

(9)  Available  forms  of  the  compilation  (printed  handbook,  computer 
tapes,  optical  coincidence  indexes,  tabular  indexes,  cdge-notchcd 
cards,  3x5  cards,  etc.  ) 

(10)  Time  period  coverage  and  growth  ra’e  of  compilation. 

(11)  File  structures  developed  to  accommodate  the  above. 

Data  for  our  analysis  were  first  obtained  through  a  review  of  and  extraction  from 
specimen  reports  and  descriptive  literature  issued  by  da>a  compiling  activities. 
Considerable  amounts  of  time  and  effort  were  spen’  in  obtaining  this  data  from 
sources  in  the  SCI  files,  the  National  Standard  Reference  Data  Office,  the  NAS- 
NRC  Office  of  Critical  Tables,  etc. 

A  second  step  of  our  analysis  was  to  obtain,  through  interviews  with  data  com¬ 
pilation  directors  and  personnel,  verification  of  our  preliminary  findings,  oper 
ating  statistics,  communities  the  compilation  is  intended  to  serve,  current 
design  rationales,  experience  with  other  form’s,  and  information  we  had  not 
acquired  from  our  review  of  their  descriptive  literature  and  specimen  outputs. 

As  much  information  and  data  as  possible  was  extracted  and  placed  on  work 
sheet  forms  before  the  data  compilation  director  was  interviewed.  By  using 
this  approach  interviews  consisted  of  informed  questions  covering  the  internal 
and  external  operations  of  the  activity.  Interviews  were  productive,  and 
cooperation  was  excellent. 

Analysis  of  the  collected  information  began  as  soon  as  meaningful  amounts  had 
been  obtained.  In  the  course  of  the  development  of  our  information  gathering 
worksheets,  we  provided  for  the  principal  correlations  which  we  intended 
to  perform  with  the  assembled  information.  These  correlations  are  identified 
in  the  following  matrix  (X  indicates  performance  of  a  correlation;  --  indicates 
no  correlation;  the  more  important  correlations  to  be  performed  are  indicated 
by  +). 
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NOTE:  The  matrix  row  numbers  are  the  same  as  the  column  numbers. 


The  correlation  objectives  place  particular  emphasis  on: 

(1)  identification  and  characterization  of  significant  information 
used  in  chemical  data  processing, 

(2)  realistic  estimates  of  the  volumes  of  chemical  data  compiled 
to  serve  identified  user  needs, 

(3)  identification  of  substance  and  property  coverage  among 
chemical  data  compilations,  and 

(4)  the  implications  of  the  structure  and  content  of  existing  chem¬ 
ical  data  compilations  for  design  and  operation  of  future  auto¬ 
mated  chemical  data  and  information  systems. 

III.  Work  Accomplishments 

Our  initial  efforts  were  directed  toward  the  development  of  criteria  for  the 
selection  of  data  compilation  activities  to  be  analyzed.  The  basic  criteria 
established  for  the  data  compilations  studied  are  that  they  should: 
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(1)  cover  either  physical  or  chemical  properties  of 
chemically  identifiable  substances, 

(2)  cover  a  significant  segment  of  chemical  substances 
or  properties, 

(3)  serve  a  significant  user  population, 

(4)  disseminate  compilations  of  data  fo  ihe  technical 
community,  and 

(5)  have  existed  on  a  continuing  basis  for  a  period 
adequate  to  verify  operational  validity  of  use  to 

the  development  of  a  National  Chemical  Information 
System. 

The  fifth  criterion  was  a  screening  consideration  in  our  selection  of  data  com¬ 
pilation  activities  to  be  analyzed.  The  activities  chosen  for  study  fulfill  all  cri¬ 
teria  in  varying  degrees,  but  the  fifth  criterion  was  in  many  cases  a  deciding 
factor  for  or  against  inclusion. 

The  office  of  Standard  Reference  Data  of  the  National  Bureau  of  Standards  made 
available  to  us  early  in  the  project  the  preliminary  tabulation  of  results  from  the 
following  question  in  their  survey  of  the  data  use  practices  of  the  members  of  the 
American  Chemical  Society: 

"Which  data  compilations  do  you  consult,  most  often?  Please 
list  small,  highly  specialized  compilations  as  well  as  more 
extensive  compendia.  " 

As  might  be  expected,  the  response  to  this  survey  showed  that  handbooks  and 
large  compendia  such  as  Lange's  Handbook  of  Chem.stry  or  the  International 
Critical  Tables  proved  to  be  useful  to  a  significantly  large  number  of  users. 
Significant  though  such  collections  are  however,  in  the  realm  of  chemical  data 
their  modus  ope r and i  of  periodic  revision  of  an  editorial  nature  did  not  meet  the 
fourth-criterion  sufficiently  to  merit  coverage. 

Efforts  were  made  to  include  a  representative  sampling  of  each  of  the  following 
chemical  substances  and  systems: 

(1)  pure  chemical  elements, 

(2)  pure  chemical  compounds  (organic  and  inorganic),  and 

(3)  mixtures  of  compounds  and/or  elements. 

A  literature  search  was  conducted  to  identify  candidate  chemical  data  compila¬ 
tion  activities  for  analysis.  A  list  of  candidate  compilations  was  developed 
through  the  examination  of  standard  directories  to  technical  information  services 
and  systems,  and  other  local  sources. 
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Efforts  to  characterize  selected  compilations  m  a.n  analytical  fashion  re  vealed 
that  compilation  activities  which  were  initially  considered  as  a  single  compil¬ 
ation  were  in  fact  multiple  compilations  prepared  by  one  housing  institution. 

We  therefore  found  it  necessary  to  characterize  the  parameters  of  each  com  - 
pilation  where  several  compilations  were  prepared  in  one  housing  institution. 
This  provided  an  opportunity  to  look  for  parameters  of  a  compilation  ihat 
were  influenced  by  the  basic  communication  requirements,  as  distinguished 
from  those  conditioned  by  the  housing  environment  fe.g.  ,  processing  traditions, 
availability  of  special  support  facilities  etc.).  Other  influences,  such  as  the 
special  skills  of  the  center's  key  staff,  the  nature  of  the  subject  field,  sponsor 
influence  on  the  center's  mission,  and  the  limited  number  of  housing  institu¬ 
tions  in  the  sample  swamped  out  any  distinctions  that  could  be  identified  with 
housing  institutions. 

Data  compilation  activities  examined  were  in  the  following  housing  institutions: 

(1)  OaK  Ridge  National  Laboratory,  Atomic  Energy  Commission, 

Oak  Ridge,  Tennessee:  Atomic  and  Molecular  Processes  Infor¬ 
mation  Center. 

(2)  National  Bureau  of  Standards,  Boulder,  Colors  do  Cryogenic 
Data  Center. 

(3)  Electrical-Electronic  Properties  Information  Program,  Hughes 
Aircraft  Company,  Culver  City,  California:  Electronic  Prop¬ 
erties  Information  Center. 

(4)  Dow  Chemical  Company,  Midland,  Michigan:  Joint  Army-Navy- 
Air  Force  (JANAF)  Thermochemical  Tables- 

(5)  Texas  A&M  University,  College  Station,  Texas:  Thermodynamics 
Research  Center  Projects  on  Properties  of  Chemical  Compounds 
(API  Project  and  TRC,  formerly  MCA*. 

(6)  National  Bureau  of  Standards,  Gaithersburg,  Maryland:  Selected 
Values  of  Thermodynamic  Properties  of  Inorganic  Substances 
Data  Center  (NBS  Circular  500). 

(7)  University  of  California,  Berkeley,  California:  Thermodynamic 
Properties  of  Metals  and  Alloys  Data  Center. 

(8)  Purdue  University,  Lafayette,  Indiana:  Thermophysical  Proper¬ 
ties  Research  Center. 

(9)  Wyandotte  Chemical  Corporation,  Wyandotte,  Michigan:  The 

Wyandotte -ASTM  Spectral  Data  Project. 

(10)  American  Society  for  Testing  and  Materials,  Philadelphia,  Penn¬ 
sylvania:  The  Joint  Committee  on  X  Ray  Powder  Diffraction  Data. 

The  information  collected  for  analysis  was  grouped  into  eleven  categories. 

The  primary  and  secondary  means  of  acquiring  each  category  type  of  infor¬ 
mation  were  formulated  as  shown  in  the  following  table- 
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Sources  of  Information  for  Analysis 


Information 

(1)  Functional  design  objectives 

(2)  Users  served 

(3)  Substances  covered 

(4)  Properties  covered 

(5)  Data  acquisition  procedures 

(6)  File  formats  and  characteris¬ 

tics 

(7)  Output  and  service  forms 

(8)  Size  of  activity 

(9)  Distribution  of  effoit 
(10)  Source  of  Funds 
(VI)  Year  established 


Primary  Source 

Interview 
Interview 
Specimen  analysis 
Specimen  analysis 
Interview 

Specimen  analysis 


Secondary  Source 

Descriptive  Literature 
Descriptive  Literature 
Descriptive  Literature 
Descriptive  Literature 
Descriptive  Literature 

Interview 


Specimen  analysis  Interview 
Descriptive  literature  Interview 
Interview  Descriptive  Literature 

Interview  Descriptive  Literature 

Descriptive  literature  Interview 


In  accordance  with  the  above  plan,  work  sheets  were  designed  and  developed, 
and  a  file  of  descriptive  literature  for  each  compilation  to  be  studied  was  assem¬ 
bled.  The  descriptive  literature  file  was  assembled  from  the  SCI  library  and 
its  bibliographic  references,  the  Defense  Documentation  Center,  Technical  Ab 
stracts  Bulletin,  and  direct  inquiries  to  the  compilation  activities.  Annual  re 
ports,  progress  reports,  and  their  discussions  of  coverage,  processing  methods, 
operating  statistics,  and  services  rendered  proved  to  be  the  most  useful  type 
of  descriptive  literature.  Our  initial  worksheets  were  developed  for  gathering 
of  information  from  such  literature  and  from  specimen  analysis.  Information 
gathered  with  this  worksheet  was  adequate  for  a  general  characterization  of 
each  compilation  activity,  but  was  not  sufficiently  definitive  to  characterize 
quantitatively  a  specific  compilation  prepared  in  an  activity  which  produces  sev¬ 
eral  compilations  We  also  encountered  problems  resulting  from  the  hetero 
geneity  of  terminology  found  in  the  descriptions  of  compilation  activity,  and  it 
became  apparent  that  this  heterogeneity  would  at  some  time  have  to  be  resolved 
by  a  generalized  vocabulary  or  concept  of  data  compilation  operations  if  a  sys  ¬ 
tematic  characterization  and  cross-section  analysis  of  different  compilations 
were  to  be  a*,  complished.  We,  therefore,  decided  to  develop  a  generalized 
concept  of  data  compilation  operations  and  to  incorporate  it  into  the  informa¬ 
tion-gathering  worksheet,  so  that  information  would  be  uniformly  gathered  and 
the  problem  resolved  in  the  course  of  our  work  instead  of  being  an  obstruc¬ 
tion  to  its  completion. 


The  following  table  identifies  typical  processing  operat  ons  we  identified  as 
basic  to  data  compilation  activities: 


i 

I 

I 

I 

I 

I 
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Typical  Processing  Operations  in  Data  Compilation  Activities 

A.  Data  Source  Processing  Operations 

1.  Search  for  source  documents 

2.  Obtain  source  documents 

3.  Screen  source  documents  for  relevance  and  duplication 

4.  Catalog  source  documents 

a.  bibliographic  characterization 

b.  content  characterization 

5.  Process  source  documents  for  storage 

6.  Input  source  documents  to  storage 

7.  Search  source  documents 

8.  Output  references  from  search  of  source  document  file 

B.  Data  File  Construction 

1.  Assemble  appropriate  source  documents 

2.  Extract  data  and  appropriate  information 

3.  Screen  and  evaluate  extracted  data  and  information 

4.  Process  data  and  information  for  storage 

C.  Output  Operations 

1.  Interpret  data  inquiry  for  publication  requirement  and 
formulate  appropriate  search  strategy 

2.  Search  and  retrieve  data  and  information 

3.  Edit  search  output 

4.  Disseminate  appropriate  data  and  information 

Generalized  flow  charts  of  these  processes  were  developed  (figures  1,2,3),  and 
our  information-gathering  worksheet  was  redesigned  to  gather  information  in 
reference  to  this  conceptualization  of  the  data  compilation  process. 

Experience  proved  that  the  flow  charts  and  operations,  with  minor  changes, 
satisfactorily  covered  the  principal  operations  of  data  compilation  activities 
of  interest  to  the  study.  By  incorporating  this  concept  into  the  worksheet, 
information  given  on  an  individual  level  in  progress  and  technical  reports  of 
the  data  centers  was  reduced  to  a  uniform  level.  The  concept  proved  satisfac¬ 
tory  in  interviews  with  the  compilation  directors.  It  eliminated  confusion  and 
aided  communication  between  the  interviewer  and  interviewee.  Not  all  of  the 
operations  in  the  conceptual  model  were  performed  by  all  of  the  data  compi¬ 
lation  activities  examined.  Some  of  the  operations  and  services  were  omitted 
entirely  by  some  activities,  performed  simultaneously  in  a  single  operation, 
or  were  far  more  important  in  one  compilation  activity  than  in  another.  This 
variety  was  expected.  The  conceptualization,  however,  served  its  purpose  in 
reducing  such  variety  from  descriptive  paragraphs  to  specific  terms  and  oper 
ational  elements. 


i  . 


# 


8ol«no«  Communication 

Washington,  D.  C.  200  07 

NSF  C-478  Final  Report  -11- 

20  March  1967 


n 

W 

c£ 

D 

a 

[a 


c 

o 


<0 

d 

o 

O 

v 

3 

P‘4 

rt 

ts 

O 


•ol»no«  Communication 

Washington,  D,  C.  200  07 

NSF  C-478  Final  Report 
20  March  1967 


toitnot  Communion  Hon 

Washington,  O.  C.  200  07 

NSF  C-478  Final  Report  -13- 

20  March  1967 


The  table  given  below  is  a  coarse  description  that  highlights  the  principal 
operational  file  elements  of  the  compilation  activities  studied. 


Center 

Code 

Center  Name 

Data  Source 
File 

Data  Source  & 
Data  File 
Combination 

Data  File 

EPIC 

Electronic  Properties  Infor¬ 
mation  Center 

X 

-- 

-- 

TPRC 

Thermophysical  Properties 
Research  Center 

X 

UC 

Thermodynamic  Properties 
of  Metals  and  Alloys  Data 
Center,  U.  Calif. ,  Berkeley 

X 

CDC 

Cryogenic  Data  Center 

X 

-- 

— 

AMPIC 

Atomic  &  Molecular  Pro¬ 
cesses  Information  Center 

X 

— 

a  Sum 

Thermodynamics  Research 
Center,  Texas  A&M 

-- 

X 

NBS 

Selected  Values  of  Thermo¬ 
dynamic  Properties  of  In¬ 
organic  Substances  (NBS 
Circular  500) 

X 

JANAF 

Joint  Army-Navy-Air  Force 
Thermochemical  Tables 

-- 

X 

X 

ASTM 

Wyandotte -ASTM  Spectral  Data 
Project 

— 

X 

X-Ray 

Joint  Committee  on  X-Ray 
Powder  Diffraction  Data 

-- 

-- 

X 

The  information-gathering  worksheet  was  designed  to  acquire  information  on 
the  technique  and  rationale  of  each  compilation  activity  for  each  of  the  flow 
charted  operations  that  it  actually  performs.  The  final  version  of  the  work¬ 
sheet  was  highly  structured  (see  Appendix  D)  and  was  used  for  acquisition  of 
information  from  each  of  the  sources  given  earlier.  The  information -gathering 
sequence  of  descriptive  literature,  specimen  analysis,  and  interview  proved 
effective . 
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IV.  Findings 

Meaningful  analyses  for  our  objectives  could  be  made  on  any  of  the  following 
levels; 

(1)  Viewing  each  compilation  activity  historically,  as  it  evolved 

to  satisfy  the  specific  user  needs  in  its  unique  area  of  coverage 
and  resolve  its  specific  data  handling  problems; 

(2)  Analyzing  all  data  compilation  activities  across  the  board  to 
study  the  larger  problems  which  they  face  in  common; 

(3)  Analyzing  groupings  of  compilations  that  were  distinct  and 
meaningful,  either  through  similarity  in  the  types  of  data 
which  they  handle,  in  compilation  formats,  or  in  the  various 
services  they  provide. 

All  three  levels  proved  useful  in  the  analysis. 

Compilation  Center  Activities.  A  listing  of  the  data  compilations  produced  by 
the  centers  interrogated  is  given  in  Appendix  B.  Appendix  C  contains  a  sum¬ 
mary  description  of  each  activity's  objectives  and  operational  procedures,  flow 
charts  of  its  document-  and  data-processing  operations  for  compilation  and 
output  development,  and  a  chart  describing  the  physical  arrangement  and  con¬ 
tent  of  its  principal  internal  files  for  accomplishing  these  operat’ins.  Figure  4 
presents  groupings  of  data  compilations  in  terms  of  output  and  coverage  classes 
that  were  considered  best  suited  for  more  detailed  analysis.  The  groups  es¬ 
tablished  are  by  no  means  mutually  exclusive.  They  are  used  to  delineate  broad 
views  of  the  variety  of  compilation  activities  we  identified  and  to  characterize 
the  groups.  Compilations  presenting  graphic  data,  which  essentially  provide 
an  infinite  number  of  values,  do  not  furnish  statistics  compatible  with  compil¬ 
ations  presenting  single  values.  Likewise,  compilations  comprising  cumulative 
tables  in  prescribed  substance -property  formats  are  amenable  to  quantitative 
analysis,  while  monographs  required  a  descriptive  level  of  analysis  to  accom¬ 
modate  varying  formats  of  presentation,  and  interpretive  discussions  associated 
with  the  values  given.  Efforts  were  made  to  display  these  intrinsic  differences 
in  the  master  worksheet  tabulation  in  Appendix  A.  The  occasional  "not 
applicables"  in  the  tabulations  indicate  that  this  was  not  always  possible. 

Other  significant  distinctions  beside  those  noted  by  Figure  4  exist  among  the 
data  compilation  activities.  Although  we  found  it  necessary  to  construct  the 
generalized  model  represented  by  the  worksheet  as  a  check-point  sequence 
in  our  information -gathering  operation,  we  did  not  expect  actual  data  compil¬ 
ation  activities  to  fit  any  single  model.  A  second  model  built  on  the  operating 
and  service  philosophies  associated  with  a  data  compilation  activity  proved 
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to  be  useful  in  analysis  of  different  patterns  actually  found,  and  should  have 
value  in  recognizing  the  design  requirements  of  future  data  compilation 
operations.  This  model  comprised  the  following  sets  of  servicing  and 
operating  objectives: 

Data  Acquisition  and  Screening  Objective 

■  Collect  and  disseminate  all  data  in  area  covered. 

■  Collect  all  data  and  disseminate  only  selected  data  in  area 
covered. 

■  Collect  and  disseminate  only  selected  data  in  area  covered. 

Information  Provided  Users 

•  References  to  sources  of  data. 

•  Usable  data  with  reference  to  information  sources. 

•  Usable  data  without  reference  to  information  sources. 

Type  of  User  Service 

■  Response  to  individual  inquiries. 

»  Periodic  dissemination  of  accumulated  information  or  data. 

Other  Pur  poses  of  Compilation  Activity 

a  Reduce  costs  for  the  user  to  obtain  reference  and/or  information. 

a  Increase  the  utility  of  data  by  conversion  to  a  more  useful  format. 

a  Generate  new  data  by  computation  or  extrapolations  with  data 
compiled. 

a  Conserve  scientific  manpower  by  eliminating  duplication  of 
evaluation  efforts. 

This  model  provides  an  insight  as  to  how  current  compilation  activities  are 
designed  and  operated  in  order  to  meet  the  needs  of  identified  users,  and  was 
incorporated  into  our  information  gathering  worksheets  and  analysis  procedures. 
The  following  findings  resulted: 

A.  Data  Acquisition  and  Screening 

Three  basic  types  of  compilation  activity  were  postulated  in  our  model: 

(1)  Collect  all  data  and  output  all  data,  with  only  moderate  evaluation, 

(2)  Collect  all  data  and  output  selected,  highly  evaluated  data,  and 
(8)  Collect  selected  data  and  output  selected,  highly  evaluated  data. 
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Activities  limited  to  Category  (1)  are  restricted  essentially  to  accumulating  and 
reformatting  functions.  Such  activities  are  characterized  by  high  output  volumes. 
An  example  of  this  type  of  operation  would  be  the  Wyandotte -ASTM  Spectral 
Data  Project,  which  collects  all  published  and  commercial  infrared  and  visible 
spectral  curves.  Wyandotte  then  codes  the  spectra  for  both  band  positions 
and  chemical  structure  and  places  the  coded  data  in  a  standardized  format 
for  dissemination  into  a  variety  of  publications,  which  range  from  tabular 
indices  to  computer  tapes. 

The  majority  of  data  compilation  groups  in  our  sample  fall  in  Category  <2). 

These  are  chemical  data  compilation  activities  which  screen  the  entire  open 
literature  for  measured  values,  organize  these  sources  of  information,  and 
then  evaluate  the  data  to  produce  a  "best  estimate, "  recommended  value. 

The  requirement  to  identify  the  original  source  document  in  the  data  evalua¬ 
tion  process  provides  the  raw  material  for  a  bibliographic  referencing  service, 
and  most  Category  (2)  activities  offer  such  services. 

None  of  the  activities  in  our  sample  fell  into  Category  (3). 

As  will  be  noted  in  Figure  5,  some  of  the  centers  in  our  sample  fall  into  more 
than  one  category.  The  compilation  level  can  be  influenced,  for  example,  by 
the  substances  and  properties  for  which  data  is  given.  JANAF,  for  example, 
presents  property  data  for  pure  substances  in  an  area  where  basic  data  and 
knowledge  of  substance -property  relationships  is  sophisticated.  They  store 
evaluated  fundamental  thermochemical  constants  and  mathematical  formulae 
on  magnetic  tapes  which  they  use  to  generate  extensive  tables  of  thermochemical 
functions.  On  the  other  hand,  the  Thermophysical  Properties  Research  Center 
at  Purdue  University  collects  data  on  the  thermo-transport  properties  of 
all  substances,  many  of  which  are  solids  in  which  trace  impurities  and  struc¬ 
tural  defects  have  major  influence  on  properties.  In  this  field,  rigorous 
establishment  of  new  values  by  computation  or  extrapolation  from  other  values 
is  difficult  or  impossible.  Considerable  empirical  data  and  judgemental 
appraisal  of  reported  measurements  must  be  assembled  before  an  expert  can 
ascertain  the  most  probable  value  for  the  purposes  of  identified  user  interests. 
Even  then,  TPRC  has  found  it  most  valuable  to  disseminate  in  a  concise  format 
all  the  reliable  empirical  data  used  to  ascertain  these  most  probable  values, 
because  of  the  composition  variances  in  the  substances  covered  and  similar 
variances  in  the  alternate  grades  of  a  material  available  to  the  user. 

The  inverse  relationship  between  the  volume  of  selected  values  presented  in 
the  Thermophysical  Properties  Research  Center  Data  Book,  compared  with 
the  JANAF  Therrr  ochemical  Tables,  probably  results  largely  from  this 
difference  in  substance -property  characterization.  The  TPRC  Data  Book 
inputs  and  displays  a  large  mass  of  data  with  the  final  objective  being  to 
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FIGURE  5. 

SERVICE  PHILOSOPHIES  AND  EMPHASES 
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output  a  single  best  estimate  curve  in  the  context  of  all  relevant  data.  In  con” 
trast  the  JANAF  Thermochemical  Tables  need  only  input  and  display  a  limited 
number  of  property  values  (which  are  documented  on  the  reverse  side  of  each 
page  of  tables)  with  the  output  resulting  in  an  extensive  table  of  thermochemical 
functions. 

B.  Information  Provided  User 


The  type  of  data  processed  is  not  the  sole  determinant  for  the  category  of 
data  compilation  output  which  will  be  established.  The  user  of  the  output  of 
the  compilation  is  also  a  significant  factor.  For  example,  the  design  engineer 
is  quite  willing  to  accept  a  single  best-estimate  value  for  the  heat  transfer 
coefficient  of  aluminum;  in  contrast,  a  research  scientist  planning  to  remeasure 
this  value  would  prefer  to  have  access  to  all  previously  measured  values. 

One  of  the  supplemental  information  outputs  most  frequently  associated  with 
the  centers  examined  is  a  bibliographic  reference  to  the  source  of  the  data. 

In  addition,  we  found  that  data  compilation  activities  normally  exhibit  an 
evolutionary  development.  In  the  history  of  individual  centers,  they  often 
first  provide  data  reference  services,  usually  in  the  form  of  bibliographies 
indexed  to  identify  the  substance  and  property  coverage  of  the  references 
included.  Data  file  construction  and  servicing  of  data  needs  of  users  appeared 
to  evolve  sequentially  as  the  "store”  became  larger  and  the  center  extracted 
and  organized  sufficient  amounts  of  data.  Our  study  efforts  included  only 
those  compilation  activities  which  provide  the  user  with  some  form  of  data. 

There  are  undoubtedly  activities  now  in  operation  whose  mission  is  to  pro¬ 
vide  data  services,  but  whose  operational  level  is  still  at  the  bibliographic 
level. 

Regardless  of  the  stage  of  development  for  the  type  of  user  service  provided, 
data  source  acquisition  and  processing  as  opposed  to  data  organization,  evalu¬ 
ation  and  dissemination,  is  performed  by  all  data  centers  in  our  study  sample. 

The  "young"  category  of  data-oriented  information  services  (many  of  which 
might  be  expected  to  evolve  from  the  bibliographic-servicing  to  the  data-servicing 
level)  is  therefore  a  potentially  important  activity. 

Texas  A&M,  the  National  Bureau  of  Standards  Circular  500,  and  the  JANAF 
project  stated  that  the  provision  of  references  to  sources  of  data  for  the  user 
is  a  secondary  objective  ■while  the  remainder  of  the  centers  consider  provision 
of  references  to  sources  of  data  one  of  their  primary  objectives.  Most  of  the 
centers  provide  the  user  with  usable  data  and  information  sources:  half  of  the 
centers  also  provide  the  user  with  usable  data  without  reference  to  information 
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sources.  In  our  discussion  with  the  directors  and  staff  of  these  data  compila¬ 
tion  activities,  they  indicated  that  they  devote  a  substantial  portion  of  their 
total  effort  to  acquiring  and  screening  sources  of  data.  The  magnitudes  of 
these  efforts  are  reflected  in  the  flow  charts  and  inter  al  file  descriptions  in 
Appendix  C  and  in  the  distribution  of  effort  tabulations  (presented  in  Appendix 
A,  pages  iii-iv). 

Many  activities  utilize  secondary  publications,  such  as  abstract  services,  in¬ 
dices,  etc.  ,  for  locating  and  acquiring  candidate  data  sources.  This  often 
contributes  to  a  substantial  time  delay  between  the  publication  of  a  new  value 
or  bit  of  data  and  its  availability  to  the  users  of  the  compilation.  In  some  cases 
data  centers  have  gone  to  great  lengths  to  formulate  screening  and  acquisition 
procedures  to  meet  their  specific  needs.  Many  times,  the  information  provid¬ 
ed  the  user  reflects  a  compromise  between  what  the  compilation  director 
would  like  to  provide  and  what  the  economics  of  the  project  will  permit. 

In  only  a  few  cases  are  the  content,  the  format  and  the  recording  media  of 
the  output  of  the  centers  all  tailored  to  the  needs  of  specifically  identified  users. 
The  X-Ray  Diffraction  project  disseminates  their  data  output  in  many  formats 
tailored  to  meet  many  different  users'  neecfe,  which  vary  from  computer  tapes  to 
tabular  books  and  3x5  cards.  They  also  have  internal  differences  in  the  same 
output  format.  The  Hanawalt-Davey  (tabular)  Index  lists  the  three  strongest 
lines,  their  relative  intensities,  the  chemical  formula  of  the  substance,  the 
name  of  the  substance  and  the  accession  number  of  the  data  card.  The  Fink 
Index  (tabular)  uses  the  eight  strongest  lines  in  ier  of  their  relative  intensi¬ 
ty  (cyclically  permutated),  the  name  of  the  substance  and  the  accession  number 
of  the  data  card.  Thus  this  data  activitiy  can  accommodate  many  different  types 
of  users  (large  corporation,  small  companies,  independent  users)  and  provides 
a  choice  of  technical  presentations  of  the  same  data.  The  ASTM -Wyandotte 
Spectral  Data  project  is  another  example  of  multiple -format  output. 


C.  Type  of  User  Service 


All  of  the  activities  investigated  give  as  a  primary  objective  the  periodic  dis¬ 
semination  of  accumulated  information  or  data  (see  Figure  5),  while  only  two 
activities  give  as  a  primary  objective  the  provision  of  quick  response  to  in  ¬ 
dividual  queries.  Six  chemical  data  activities  identify  quick  service  as  a 
secondary  objective  or  one  that  is  desirable  but  subordinated  by  other  consi¬ 
derations. 


We  believe  this  finding  does  not  imply  that  the  servicing  of  demand  inquiries  is 
a  secondary  need  in  chemical  data.  Rather,  probably  it  reflects  limitations  in 
the  manpower  and  monetary  resources  of  the  centers,  which  have  settled  for 
programs  of  periodic  publication,  guided  by  an  understanding  of  current  needs. 
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P.  Overlap  of  Coverage 

In  this  study,  we  found  some  overlap  between  centers  with  respect  to  substance - 
property  coverage.  In  the  field  of  physiochemical  and  thermodynamic  proper¬ 
ties,  the  overlap  appeared  primarily  in  the  basic  elements  or  the  hard  core  data. 
It  seems  apparent  that  this  overlap  is  desirable  and  convenient  from  the  user’s 
point  of  view.  Many  data  center  directors  shared  the  view  that  their  published 
documents  would  be  incomplete  without  this  basic  data.  It  would  be  reasonable  to 
assume  that  these  basic  data  could  be  amassed  or  disseminated  by  a  designated 
member  of  the  National  Chemical  Information  System.  This  should  decrease 
significantly  the  time  gap  and  duplicate  effort  in  evaluation  of  tiose  numerical 
property  values  appearing  in  the  primary  literature  that  find  use  in  the  critical 
compilations  of  individual  scientists  and  specialized  centers. 

A  typical  example  of  the  basic  element  overlap  would  be  the  element  oxygen 
(monoxygen)  in  the  gaseous  phase,  where  JANAF,  Texas  A  &  M,  and  the  Con¬ 
tributions  to  the  Data  on  Theoretical  Metallurgy  produced  by  Dr.  K.  K.  Kelley 
record  heat  content  (enthalpy,  heat  capacity,  entropy  and  others)  for  oxygen  at 
273°  -2000°  K,  while  the  National  Bureau  of  Standards  Circular  500  records  heat 
capacity  at  a  specific  temperature  within  the  same  range.  Using  carbon  (C)  in 
the  gaseous  phase,  JANAF,  Texas  A  &  M,  Contributions  to  the  Data  on  Theo¬ 
retical  Metallurgy,  and  Selected  Values  for  the  Thermodynamic  Properties  of 
Metals  and  Alloys  record  heat  content  or  enthalpy  in  similar  temperature  ranges. 
Similar  comparisons  can  be  shown  for  other  basic  elements  (nitrogen,  hydro¬ 
gen,  sulfur,  etc.  )  or  other  hard  core  data. 

Examinations  of  existing  compilations  indicate  that  the  amount  of  information 
stored  and/or  disseminated  varies  considerably  depending  on  the  nature  of  the 
data  and  of  the  user  populations.  Consequently,  generalized  treatments  applied 
to  all  cnemical  substances  and  all  property  values  should  be  approached  with 
caution.  It  is  not  adequate  just  to  determine  which  information  should  be  stored; 
the  level  (or  levels)  at  which  individual  data  elements  can  or  should  be  stored 
needs  to  be  related  to  usage  levels. 

E.  Nomenclature 

Our  findings  (see  Appendix  A,  pages  x-xii)  show  that  all  data  compilation  acti¬ 
vities  studied  index  the  information  under  basic  substance  and  property  para¬ 
meters  if  they  have  a  data  source  file.  Almost  all  of  the  systems  are  keyed 
to  CA  nomenclature  to  the  extent  that,  given  the  CA  name  for  a  substance,  the 
CA  coding  of  that  substance  could  be  applied  to  the  system. 

Exceptions  to  this  rule  occur  in  the  cases  of  the  Cryogenic  Data  Center,  the 
Thermodynamic  Properties  of  Metals  and  Alloys  Data  Center  at  Berkeley,  and 
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the  Atomic  and  Molecular  Processes  Information  Center.  These  activities  deal 
with  a  circumscribed  area  of  substances  in  a  very  intensive  fashion,  and  have 
elaborated  either  substance  description  or  property  description  to  improve  the 
exactness  of  retrieval. 

Further  exceptions  appear  in  the  cases  of  EPIC  and  TPRC,  which  deal  in  part 
with  commercial  substances  and  other  mixtures  and  systems  outside  the  CA 
Registry  nomenclature.  These  problems,  however,  are  soluble,  since  each 
activity  has  developed  structured  means  for  indexing  substance  coverage  of  the 
literature.  The  work  required  to  make  these  structures  compatible  appears 
reasonable. 

Key  words  for  property  identification  have  also  b*-en  established  by  most  chemi¬ 
cal  data  activities  and  should  prove  susceptible  to  s  andardization. 

A  third  coordinate  used  by  data  compilation  activities  in  indexing  data  sources 
for  retrieval  are  measurement  environment  parameters.  The  vocabularies  and 
systems  are  either  divergent  or  non-existent.  A  good  amount  of  work  appears 
necessary  to  establish  uniform  terminology  criteria  of  use  to  all  activities. 

F.  Data  Obsolescence 

To  date  apparently  little  study  has  been  given  to  determination  of  the  rate  at 
which  specific  types  of  data  become  obsolete.  No  centers  studied  reported  es¬ 
tablished  purging  elements  in  their  operations  procedures.  Obviously,  this 
property  of  data  significantly  influences  the  file  structure  selected  to  store  the 
data.  If  chronological  age  provides  a  satisfactory  criterion  for  file  purging, 
the  process  can  be  easily  automated.  However,  if  each  bit  of  data  has  to  be  in¬ 
dividually  examined,  the  task  will  require  highly  sophisticated  computer  routines, 
and/or  the  efforts  of  a  highly  skilled  data  specialist. 

G.  Purposes  of  Data  Compilation  Activities 

A  common  preoccupation  of  all  data  compilation  activities  studies  was  the  de¬ 
sire  to  improve  present  expertise  and  techniques  for  compressing  the  voluminous 
data  in  the  open  literature  into  packages  of  n  lageable  size  for  the  .scientific  and 
technical  community.  Only  shades  of  difference  existed  among  compilations  in 
their  emphases  of  other  purposes.  Some  of  them  are  mentioned  below.  In  hear¬ 
ings  before  Congress,  it  was  stated  that  the  Office  of  Standard  Reference  Data 
expected  to  save  $10  to  $100  for  each  dollar  expended  in  data  compilation  and 
evaluation  efforts.  This  was  estimated  to  result  in  a  total  savings  of  $100  to 
$200  million  per  year.  The  major  portion  of  the  savings  was  based  on  a  projec¬ 
ted  reduction  of  the  time  scientists  and  engineers  spend  in  searching  for  data. 
Although  this  is  obviously  a  significant  objective  of  data  compilation  activities. 
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it  was  not  the  primary  justification  advanced  by  center  directors  in  justifying 
their  existence.  Most  of  the  more  successful  compilations  are  associated  with 
scientific  or  technical  missions  that  provide  the  economic  justification  for  the 
effort.  An  example  is  the  petroleum  industry,  which  has  supported  the  API 
Project  44.  The  government  agencies  have  supported  numerous  Air  Force  and 
other  DoD  projects,  perhaps  because  firmer  estimates  of  operating  savings  re¬ 
sulting  from  the  application  of  more  reliable  data  can  be  established  as  offsets 
against  the  cost  of  maintaining  the  data  compilation  activity.  The  association  of 
a  data  compilation  activity  with  programmatic  research  or  application  appears 
to  result  in  a  more  consistent  funding  support,  a  dynamic  service-response 
posture,  and  a  technical  scope  tailored  to  specific  identified  uses.  This  sug¬ 
gests  that  system  design  concepts  which  weaken  the  association  between  pro¬ 
gram  missions  and  data  compilation  activities  should  be  approached  with  caution. 

H.  Expertise 

Capabilities  are  being  developed  by  many  data  compilation  activities  for  genera¬ 
ting  new  data  by  computations  and  other  projections  from  the  available  data  base 
where  basic  data  and  knowledge  of  substance  property  relationships  is  sophisti¬ 
cated.  The  capability  requires  highly  specialized  technical  people.  The  need  to 
conserve  this  competence  is  a  primary  concern  expressed  by  key  professionals 
in  many  of  the  data  compilation  activities  we  interviewed.  The  capacity  to  gene¬ 
rate  data  of  high  quality  and  broad  utility  finds  important  applications  in  many 
data  compilation  activities.  For  example,  the  JANAF  Thermochemical  Data  Proj¬ 
ect  stores  fundamental  thermochemical  constants,  mathematical  formulae  and 
computer  routines  which  are  used  to  generate  extensive  tables  of  thermochemical 
functions.  These  tables  are  published  and  distributed  internationally.  More  ad¬ 
vanced  processing  techniques  are  being  adopted  by  some  activities.  JANAF  pro¬ 
duces  punch  cards  and  magnetic  tapes  containing  the  fundamental  constants  with 
appropriate  automated  equipment.  The  ASTM  X-Ray  Powder  Diffraction  Data 
Project  stores  diffraction  data  on  magnetic  tape,  and  have  great  processing 
flexibility  for  producing  customized  compilations  and  indices  through  manipula¬ 
tion  of  stored  data.  They  are  also  experimenting  with  tne  generation  of  the  pow¬ 
der  patterns  by  using  a  computer -plotter  setup,  or  a  computer-driven  print-out 
system. 

In  another  area  of  data  generation,  the  Thermo  physical  Properties  Research 
Center  has  developed  the  expertise  to  be  able  to  estimate  unmeasured  property 
values  with  a  precision  adequate  to  guide  design  or  development  efforts  until 
the  value  could  be  experimentally  determined.  The  Thermodynamics  Research 
Center  utilizes  computer  techniques  for  screening  evaluated  data,  and  defining 
and  locating  their  best  fit.  The  resultants  are  usually  represented  through  im- 
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provements  in  the  accuracy  or  precision  of  the  constants  of  master  equations 
relevant  to  the  property.  TRC  has  also  developed  computational  methods  for 
defining  with  high  accuracy  end  precision  the  property -substance  curves  for 
all  compounds  in  some  homologous  hydrocarbon  series,  based  on  measurements 
of  reference-purity  samples  of  every  fourth  compound  of  the  series.  Since  the 
preparation  of  reference -purity  samples  is  an  extremely  laborious  process  de  ¬ 
manding  craft  skills  few  contemporary  scientists  are  willing  to  develop,  such 
methods  may  prove  critical  in  the  establishment  of  reference -quality  basic  data. 
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Key  Design  Considerations  for  the  Data  Processing  Element 
of  a  First-Generation  National  Chemical  Information  System 


Decentralized  Data  Compilation  Requirement. 

In  one  of  its  progress  reports  to  the  National  Science  Foundation,  Chemical  Abstracts 
Service  posed  the  questic-'  of  whether  its  information  system  should  produce  com¬ 
pendia  of  physical  propei  ties  information.  The  first  service  suggested  would  provide 
the  name  of  a  substance,  the  name  of  the  associated  physical  property,  and  the  liter¬ 
ature  reference  to  the  source  of  this  information.  A  proposed  later  service  would 
include  provision  of  this  information,  together  with  associated  data  necessary  for 
proper  interpretation  of  the  property  value  (e.  g.,  the  pressure  at  which  a  boiling 
point  was  measured.  ) 

Our  findings  suggest  that  an  alerting  service  comprising  the  simpler  of  the  two  cover¬ 
ages  described  by  Chemical  Abstracts  could  provide  a  major  relief  for  one  of  the 
largest  working  burdens  found  in  most  of  the  centers  studied.  This  is  the  process 
of  searching  for  relevant  documents  in  the  open  literature,  and  their  subsequent 
abstracting,  indexing,  and  coding.  A  great  reduction  of  essentially  duplicated  effort 
in  current  data-center  operations  appears  feasible  by  establishing  such  an  operation¬ 
al  capability,  and  coupling  it  to  compilation  centers  through  responsive  and  expedi¬ 
tious  linkages.  There  is  little  doubt  that  centers  would  welcome  such  a  linkage. 

Some  of  them  now  maintain  informal  arrangements  between  themselves  to  reduce 
overlapping  effort. 

The  evidence  of  high  scientific  and  processing  expertise  in  many  of  the  center 
operations  similarly  impressed  us  that  subject-specialized  data  centers  possess  a 
sensitivity  to  their  subject-matter  that  would  be  most  difficult  to  centralize.  They 
are  often  centers  of  scholarly  research  directed  toward  advancement  of  the  estab¬ 
lished  structures  of  the  field  itself.  The  data  activity  they  also  carry  on  comprises 
one  of  the  key  tools  for  this  research  activity. 

We  thus  suggest  that  the  design  thinking  for  the  data  component  of  the  National 
Chemical  Information  System  contemplate  a  multiplicity  of  specialized  compilation 
activities  suitably  linked  to  other  elements  of  the  System. 

Data  Source  Screening  Requirement. 

When  testifying  in  support  o.t'  H.  R.  16897,  Dr.  Donald  Hornig,  Director,  Office  of 
Science  and  Technology,  stated,  "The  Standard  Reference  Data  System  should  be 
considered  part  of  the  total  national  network  of  scientific  and  technical  information 
systems.  "  In  regard  to  the  question  of  the  interface  between  the  Standard  Refer¬ 
ence  Data  System  and  the  activities  of  the  American  Chemical  Society,  Dr.  Astin, 
Director  of  the  National  Bureau  of  Standards,  stated,  "For  example,  the  first  phase 
of  the  Standard  Reference  Data  problem  is  the  search  for  literature.  It  is  also  a 
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problem  that  is  common  with  the  broad  scientific  information  problem.  So  in 
improving  techniques  of  search  for  literature  and  in  making  indices  of  what  is 
in  the  literature,  we  (NSRDS  and  ACS)  have  a  common  interest.  In  this  we  are 
working  together  closely.  " 

As  indicated  previously,  our  findings  reaffirm  a  need  for  improved  methods  of 
screening  the  literature  to  identify  data  potentially  useful  to  data  compilation 
activities.  We  suggest  that  the  system  requirement  for  a  national  chemical  infor¬ 
mation  system  include  a  mechanism  for  screening  data  sources  and  selectively 
notifying  appropriate  data  compilations  when  a  new  or  revised  property  value 
appears  in  the  literature.  Possible  means  of  implementing  this  system  requirement 
include  prepi  ration  of  substance -property  indices  and  production  of  citation  indices. 
The  publication  of  citation  indices  is  amenable  to  automated  methods  and  would 
provide  a  valuable  tool  for  the  data  compiler. 

Fast  Source  Document  Processing  Requirement. 

As  noted  in  our  findings,  many  data  compilation  activities  are  currently  dissatis¬ 
fied  with  the  time  delays  encountered  in  using  secondary  sources  to  locate  data 
sources.  A  national  chemical  information  processor  should  acquire  source  docu¬ 
ments  and  process  them  in  a  matter  of  days  rather  than  months  as  is  the  current 
practice  of  many  indexing  services.  KWIC  indices  and  title  announcements  often 
are  not  adequate  indicators  of  the  data  content  of  articles,  reports,  etc.  Sped  al- 
ized  express  announcement  vehicles  appear  to  be  required. 

The  Data  Subsystem. 

In  the  total  context  of  the  Chemical  Information  System  operations,  v/e  would 
suggest  the  following  requirements  for  the  data  component  of  the  system: 

(1)  Fast  announcement  of  new  or  re -measured  values  for  the  properties 
of  chemical  substances. 

(2)  Continuous  maintenance  of  a  file  of  the  most  probable  values  for  a 
hard-core  set  of  properties  of  chemical  substances. 

(3)  Effective  division  of  work  res  risibilities  between  documentation 
or  information  system  operators  and  data  specialists. 

(4)  Maintenance  of  effective  communication  between  scientists  and 
engineers  pursuing  mission  objectives  and  the  operators  of  the  data 
compilation  activities  supporting  the  pursuit  of  these  specialized 
missions. 

(5)  Segregation  of  data  into  collections  or  files  of  manageable  volume 
so  that  realistic  procedures  can  be  established  for  screening  and 
purging. 
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(6)  Provision  of  a  workable  arrangement  for  test  and  evaluation  of  file 
integration  procedures  prior  to  total  integration  of  a  data  compilation 
activity  into  a  broader  document-data  processing  system. 

The  schematic  on  the  following  page  illustrates  a  system  configuration  which  would 

meet  these  requirements. 

Implementing  Requirements. 

These  suggestions  can  be  implemented  through  the  following  development  program: 

(1)  Differentiate  work  scope  of  the  data  component  of  the  National 
Chemical  Information  System  from  the  scope  of  other  data  systems 
(e.g.,  NSRDS,  CIDS,  and  specialized  data  compilation  activities). 

(2)  Design  effective  processing  linkages  between  relevant  document¬ 
processing  and  data-processing  elements  of  NCIS. 

(3)  Develop  criteria  and  procedures  for  rapid  and  effective  announcement 
of  new  or  re -measured  values  for  the  physical  and  chemical  pro¬ 
perties  of  chemical  substances. 

(4)  Develop  criteria  and  procedures  for  establishing  and  maintaining 
an  up-to-date  file  of  the  most  probable  values  for  a  "hard-core" 
set  of  properties  of  chemical  substances,  to  provide  an  effective 
operational  service  capability. 

(5)  Analyze  chemical  data  classes  into  manageable  collections  or 
files,  and  develop  realistic  criteria  and  procedures  for  data 
screening,  file  updating,  user  servicing,  and  file  purging. 

(6)  Develop  workable  criteria  and  procedures  for  evaluating  specialized 
document-  and  data-processing  activities  that  are  potential  elements 
of  NCIS,  and  workable  policies  and  procedures  for  integrating  them 
operationally. 

Next  Tasks  to  Undertake. 

In  its  development  of  the  chemical  data  subsystem,  we  recommend  that  the 

following  tasks  be  undertaken  next  by  the  Chemical  Information  Program: 

Task  #1:  Review  the  current  and  projected  requirements  and 

capabilities  (e.g.,  current  ACS  capabilities)  for  screening 
sources  of  data  and  notifying  data  compilers  and  other 
data  users  when  new  or  re-measured  values  for  the  pro¬ 
perties  of  chemical  substances  appear  in  the  literature; 
formulate  implementation  plans  for  such  services,  includ¬ 
ing  where  possible,  plans  for  early  prototype  tests. 


9ot«no§  Communication 

Washington,  O-  C-  SSOO  07 

NSF  C -47  8  Final  Report 
20  March  1967 


FIGURE  6. 

Schematic  of  a  Proposed  National  Chemical  Information  System 
Incorporating  Specialized  Data  Compilation  Activities 
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Task  #2:  Examine  and  determine  the  extent  of  correlation  between: 

1.  "real-world"  file  structuring  practices  as  re¬ 
vealed  by  our  data  compilation  analyses  and  CID6 
user  studies; 

2.  data  file  contents  and  structuring  as  enumerated 
in  the  Systems  Requirements  Description  for  a 
National  Chemical  Information  System;  and 

3.  optimum  (from  a  data  processing  viewpoint)  sizes 
and  structurings  as  determined  by  National  Bureau 
of  Standards  Grant  GN-455,  Goodyear  Aerospace 
Corporation  Contract  NSF  C-456,  and  University 
of  Pennsylvania  Institute  f  :>r  Cooperative  Research 
Contract  No.  DAI8-035-AMC-288,  etc. 

Task  #3:  Identify  the  preferred  contents  and  structuring  for  the  data 
component  of  the  National  Chemical  Information  System 
and  recommend  implementation  plans,  including  plans  for 
prototype  tests  of  file  construction  and  user  servicing. 

Task  #4:  Review  program  description  and  planning  documents 

(NSRDS,  COSATI,  etc.  )  and  identify  effective  interfaces 
between  the  data  component  of  the  National  Chemical 
Information  System  and  other  national  scientific  and 
technical  data  systems. 
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TABULATION  OF  WORKSHEET  DATA 

I,  OPERATIONAL  CONTEXT 
A.  Housing  Organization 

Name 

Electrical-Electronic.  Properties  Information  Center,  Hughes  Aircraft 
Company,  Culver  City,  California  (EPIC) 

Thermodynamics  Research  Center,  Texas  A  &  M  University,  College 
Station.  Texas  (A  &  M) 

Thermophysical  Properties  Research  Center,  Purdue  University 
Research  Park,  Lafayette,  Indiana  (TPRC) 

Joint  Army-Navy- Air  Force  Thermochemical  Tables,  Dow  Chemical 
Company,  Midland,  Michigan  (JANAF) 

The  Wyandotte -ASTM  Spectral  Data  Project,  Wyandotte  Chemicals 
Corporation,  Wyandotte,  Michigan  (ASTM) 

Thermodynamic  Properties  of  Metals  and  Alloys  Data  Center, 

University  of  California,  Berkeley,  California  (UC) 

Cryogenic  Data  Center,  Cryogenic  Engineering  Labroatory,  National 
Bureau  of  Standars,  Boulder,  Colorado  (CPC) 

Atomic  and  Molecular  Processes  Information  Center,  Oak  Ridge 
National  Laboratory,  Oak  Ridge,  Tennessee  (AM PIC) 

Chemical  Thermodynamic  Data  Group,  Thermochemistry  Section, 
Physical  Chemistry  Division,  Institute  of  Basic  Standards, 

National  Bureau  of  Standards  (NBS) 

Joint  Committee  on  Chemical  Analysis  by  Powder  Diffraction  Methods, 
American  Society  for  Testing  and  Materials,  Philadelphia  (X-RAY) 
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is.  Type  of  institution: 

a.  Governmental 
NBS 
CDC 
AMFIC 

c.  Educational 
UC 

TPRC 
A  &  M 


b.  Industrial 
JANAF 
EPIC 

d.  Professional  Societies 
X-RAY 
ASTM 


Print. ary  motivation  for  data  compilation  activities: 

a.  In- house  support  servicing  b.  External  servicing,  non-profit 


TPRC 

UC 

X-RAY 

CDC 

A  &  M 

ASTM 

AMPIC 

JANAF 

EPIC 

TPRC 

CDC 

NBS 

AMPIC 

Source  of  funding  for  data  compilation  activities: 

UC  -  10%  Industry 

A  &  M  - 

13%  Industry 

90%  Government 

35%  Government 

JANAF-  100%  Government 

37%  Sales 

15  %  from  Texas  A  &  M 

NBS  -  100%  Government 

TPRC  -  100%  Government 

ASTM  -  100%  Professional  Societies 

from  3ales  of  publications 

AM  PIC  -  100%  Government 


CDC  - 100%  Government 

X-RAY  -  100%  from  sales  of 
publications 

EPIC  -  100%  Government 


-A-iii 
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5.  Percentage  distribution  of  current  data  compilation  efforts  among  the 
following  processing  functions,  e.  g. , 

UC  -  (for  acquisition  of  data  sources 

5%  /  for  source  document  screening  and  evaluation 
{for  source  document  file  (library)  maintenance 

(for  data  screening  and  evaluation 
84%  (for  data  extraction 

10%  for  compilation  publication  and  dissemination 
1%  for  inquiry  handling 

A  &  M  -  5%  for  acquisition  of  data  sources 

10%  for  source  document  screening  and  evaluation 

55%  for  data  screening  and  evaluation 
5%  for  data  extraction 

5%  for  data  file  construction  and  maintenance 

10%  for  compilation  publication  and  dissemination 
~1%  for  inquiry  handling 

JA.NAF-  3%  for  acquisition  of  data  sources 

5%  for  source  document  screening  and  evaluation 
2%  .for  source  document  file  maintenance 

65%  for  data  screening  and  evaluation 
5%  for  data  extraction 

10%  for  data  file  construction  and  maintenance 

10%  for  compilation  publication  and  dissemination 

NBS  -  ["for  acquisition  of  data  sources 

70  |  for  source  document  screening  and  evaluation 
10%  'for  source  document  file  maintenance 

40%  for  data  screening  and  evaluation 
_  ffor  data  extraction 

(for  data  file  construction  and  maintenance 

10%  for  compilation  publication  and  dessemination 
5%  for  inquiry  handling 

TPRC  -  (for  acquisition  of  data  sources 

32%  <  for  source  document  screening  and  evaluation 
for  source  document  file  maintenance 

(for  data  screening  and  evaluation 
for  data  extraction 

for  compilation  publication  and  dissemination 
14%  for  inquiry  handling 
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I.  A.  5,  (cent. ) 


CDC-  ^for  acquisition  of  data  sources 

35%)  for  source  document  screening  and  evaluation 
Ifor  source  document  file  maintenance 

(for  data  screening  and  evaluation 
for  data  extraction 

for  data  file  construction  and  maintenance 

10%  for  compilation  publication  and  dissemination 
10%  for  inquiry  handling 
>1%  for  other  processing  functions 


X-RAY  - 


3% 

6% 


for 

for 


30%  for 
7%  for 
40%  for 

10%  for 
for 


acquisition  of  data  sources 

source  document  screening  and  evaluation 

data  screening  and  evaluation 
data  extraction 

data  file  construction  and  maintenance 

compilation  publication  and  dissemination 
other  processing  functions 


ASTM- 


EPIC 


90% 

<1% 


(for 
\for 

for  inquiry  handling 

(for 
33%  ifor 
(for 

(for 
33%  (for 


data  file  construction  and  maintenance 
compilation  publication  and  dissemination 


acquisition  of  data  sources 

source  document  screening  and  evaluation 

source  document  file  maintenance 

data  screening  and  evaluation 
data  extraction 


AM  PIC  - 


for  compilation  publication  and  dissemination 
33%  for  inquiry  handling 

for  other  processing  functions 

(for  acquisition  of  data  sources 
for  source  document  screening  and  evaluation 


60% 


^for  source  document  file  maintenance 


{for  data  screening  and  evaluation 
for  data  extraction 

for  data  file  construction  and  maintenance 


-A-iv- 


5% 

5% 


for  compilation  publication  and  dissemination 
for  inquiry  handling 
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II.  DATA  SOURCE  (LIBRARY)  COLLECTION  AND  PROCESSING 
—  ""  —  —  ■  ■  «  - ■»■■■» 

A.  Identification  and  Acquisition  of  Sources  of  Data 

1.  Which  of  the  following  are  screened  in  searching  for  candidate  document 
titles  or  data  sources?  * 

a.  Journal  Articles:  b.*  Catalog  Cards: 


uc  - 

(Approx.  ) 

5% 

CDC  -  (Approx. ) 

2% 

A  &  M  - 
JANAF  - 
NBS  - 

15% 

5% 

60% 

65% 

c. 

* 

Bibliographic  Citations 

in 

TPRC  - 

Indexed  documents: 

CDC  - 

45% 

UC  -  (Approx. ) 

2% 

X-RAY  - 

100% 

A  &  M  - 

20% 

EPIC  - 

65% 

NBS  - 

2-3% 

AMPIC  - 

90% 

TPRC  - 

24% 

CDC  - 

15% 

Index  and  Abstract  Publications: 

UC- 

(Approx.  ) 

85% 

e. 

Computerized  Tape  Indexes: 

A  &  M  - 

55% 

JANAF  - 

85% 

JANAF - 

5% 

NBS  - 

35% 

f. 

Punched  Card  Indexes: 

TPRC  - 
CDC  - 

30% 

20% 

None 

EPIC  - 

25% 

AMPIC  - 

10% 

4 

Others: 

UC  - 

(Approx.  ) 

tt% 

A  &  M  - 

10% 

JANAF  - 

10% 

NBS  - 

2% 

TPRC  - 

ltt%  (for  current  contents) 

EPIC  - 

5% 

*  X-RAY  and  ASTM  do  not  have  data  source  files. 


1 

I 

I 
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2,  Total  number  of  candidate  sources  identified  annually: 

UC-  1400 

JANAF  -  1500 

NBS  *  2500 

TPRC  -  ~9200 

CDC  -  9100 

EPIC  -  '  212000 
AM  PIC  -  3500 


3.  Which  of  the  following  data  sources  are  annually  indexed? 

a.  Journal  Articles:  b.  Published  Reports  other 


ttC-  (Approx.) 

60% 

than  Journal  Articles: 

A  AM  - 

a0-65% 

UC  -  (Approx.) 

10% 

JANAF  - 

90% 

A  A  M  - 

5% 

NBS  - 

85% 

JANAF  - 

10% 

TPRC  - 

76% 

NBS  - 

12% 

CDC  - 

60% 

TPRC  - 

24% 

EPIC  - 

76.8% 

CDC  - 

25% 

AMPIC  - 

75% 

EPIC  - 

12.6% 

AMPIC  - 

25% 

Unpublished  Laboratory  Reports: 

UC  -  (Approx.) 

A&M  - 

6% 

5% 

d.  Data  Sheets  from  other 
Compilations: 

CDC  - 

13% 

UC  -  (Approx.) 

4% 

A  &  M  - 

5% 

Customized  Data  Sheets: 

CDC  - 

2% 

NBS  -  (Approx.) 

3% 

EPIC  - 

7.4% 

f.  Others: 

EPIC  -  (Approx. )  3.2% 

4.  What  is  the  form  of  data  sources  currently  accessioned? 
a.  Hardcopy:  b.  Microcopy: 


UC  -  (Approx.) 

75% 

UC  -  (Approx.) 

25% 

AA  M  - 

100% 

NBS  - 

10% 

JANAF  - 

100% 

TPRC  - 

20% 

NBS  - 

90% 

CDC  - 

30% 

TPRC  - 

80% 

AMPIC  - 

10% 

CDC  - 

70% 

X-RAY  - 

100% 

ASTM  - 

100% 

EPIC  - 

100% 

AMPIC  - 

90% 

toi«no«  Communication 

Washington.  D.  C.  SOO  07 

NSF  C-478  Final  Report  -A~vii- 

20  March  1067 


5.  What  is  the  total  number  of  data  sources  indexed  to  date? 

UC-  12,000 

A  &  M  -  30,  000 

JANAF  -  26,  000 

NBS  -  55,  000 

TPRC  -  33,  700 

CDC  -  40,  000 

EPIC  -  27,  000 

AMPIC  -  6-7,000 

X-RAY  -  13,  000 

6.  What  is  the  period  covered  by  data  sources  indexed  to  date? 


uc  - 

from  1848 

A  &  M  - 

1840 

JANAF  - 

1925 

NBS  - 

1940  (expanded  in) 

TPRC  - 

1920- 1964 

CDC  - 

from  1950 

X-RAY  - 

1940 

ASTM  - 

unknown 

EPIC  - 

unknown 

AMPIC  - 

from  1950 

7.  What  percentage  of  the  total,  usable  data  sources  issued  during  the  period 
of  coverage  has  been  indexed? 


UC  - 

85% 

A  &  M  - 

100% 

JAHAF 

>90% 

NBS  - 

97% 

TPRC  - 

99% 

CDC  - 

60% 

EPIC  - 

unknown 

AMPIC  - 

95% 

8.  What  percentage  of  currently  issued,  usable  data  sources  is  indexed? 


UC  - 

95% 

A&  M  - 

100% 

JANAF  - 

>90% 

NBS  - 

90% 

TPRC  - 

99% 

CDC  - 

85% 

EPIC  - 

90% 

AMPIC  - 

100% 
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9.  What  percentage  of  current  year  accessions  were  issued  or  published  within 
the  past  year? 


UC  - 

~1% 

A&  M  - 

50% 

JANAF  - 

90% 

NBS  - 

87% 

TPRC  - 

50% 

CDC  - 

85% 

EPIC  - 

unknown 

AMPIC  - 

85-90% 

Data  Source  Screening  &  Evaluation 


1.  What  are  principal  criteria  for  screening  of  titles? 

UC  -  Thermodynamic  articles  on  metals  and  alloys  and 

phase  diagram  data. 

A&  M  Jud  g ment  by  a  specialist.  Chemical  Abstract  per 

entry  indicates  relevant  scope. 

JANAF  -  Measurement  of  property  of  interest  to  thermal,  lab. ; 

screening  by  weighted  key  words. 

NBS  -  Search  all  organics;  publish  only  those  with  two  or 

fewer  Carbon  Atoms;  extraction  from  original  documents. 

TPRC  -  Checking  for  property  coverage  and  duplication. 

CDC  -  Primary  temperature  ranges  below  -200° F. 

EPIC  -  Experimental  data  on  the  E.  P.  I.  C. 

AMPIC  -  Atomic  and  molecular  physics 


2.  What  percentage  of  initially  considered  titles  is  rejected  without  physical 
examination  of  the  data  source? 


UC  - 

75% 

A  &  M  - 

None 

JANAF  - 

>95% 

NBS  - 

8% 

TPRC  - 

NA 

CDC  - 

40% 

EPIC  - 

90% 

AMPIC  - 

None 
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3.  What  are  the  principal  criteria  for  screening  of  data  sources? 


uc  - 

Have  to  have  Thermodynamic  or  equilibrium  information. 

A  &.  M  - 

Pure  samples,  sound  experimental  measurements, 
reputation  of  author. 

JANAF  - 

New  measurements  sought,  remeasurement  of  property 
value  is  accepted. 

NBS  - 

Pertinence  -  Chemical  thermodynamic  data  on  pure 
substances  and  mild  electrolytic  solutions. 

TPRC  - 

Pertinence,  duplication. 

CDC  - 

Temperature  range  -  40%  is  screened  out  because  it  is 
probably  non-technical. 

EPIC  - 

Duplication;  whether  actual  data  is  there,  source  quality, 
suitable  subject  matter. 

AMPIC  - 

Retained  if  contains  useful  data. 

4.  What  percentage  of  data  sources  physically  examined  is  rejected? 


UC  - 

2% 

A  &  M  - 

None 

JANAF  - 

50% 

NBS  - 

2-3% 

TPRC  - 

50% 

CDC  - 

28% 

EPIC  - 

75% 

AMPIC  - 

10-20% 

iohmot  Communioatlon 

Washington,  O.  C  300  07 

NSF  C-478  Final  Report  -A-x- 

20  March  1067  * 


C.  Data  Source  Cataloging  and  Indexing 

1,  Indicate  the  extent  to  which  the  modes  of  cataloging  and  indexing  are  used. 

UC  -  a.  90  substance  and  23  properties  and  type  of  measure¬ 

ment  descriptors  used  to  date;  b.  none  added  annually; 
c.  average  of  6  substance  and  5  properties  and  type  of 
measurement  descriptors  assigned  to  each  document. 

A&M  -  a.  20,  000  substance  indication  and  60-70  property 

identification  descriptors  are  used  to  date,  140-150 
types  of  measurement  descriptions  are  given,  but  the 
system  is  not  organised  on  this  parameter  and  they 
nave  not  been  reduced  to  '‘terms";  b.  approximately 
200  substance,  2-3  property,  and  5  types  of  measurement 
are  added  annually;  c.  5-10  substance,  3-5  property, 
and  3-5  type  of  measurement  descriptors  are  assigned 
to  each  document;  d.  approximately  3,  000  file  entries 
per  year. 

JANAF  -  a.  10,  000  substance  identification,  70  property  identi¬ 

fication,  18  type  and  methods  of  measurement,  and  9 
substance  characterization  descriptors  are  used  to  date; 
b.  approximately  1,  000  substance  identification  descriptors 
are  added  per  year;  c.  an  average  of  5  descriptors  are 
assigned  to  each  source  document;  d.  7,  500  descriptor 
entries  annually. 

NBS  -  Approximately  40,  000  substance  assignments  used  to 

date;  properties  grouped  into  10  categories,  with  13 
subcategories;  type  of  measurement,  temperature 
range  etc. ,  not  reduced  to  “terms"  but  is  specified  in 
content;  10,  000  file  entries  per  year. 

TPRC  -  a.  45, 116  substance,  7  property,  7  treatment  of 

subject,  9  substance  state,  and  9  language  of  article 
descriptors  are  used  to  date;  b.  rate  of  addition 
of  substance  descriptors  is  not  available,  property, 
subject  treatment,  substance  state,  and  language 
descriptors  are  fixed  in  number;  c,  an  average  of  2.  5 
to  3  substance  and  property  descriptors  assigned  to  each 
source  document,  1  subject,  1  substance  state,  and  1 
language  descriptor;  d.  approximately  12,  650. 
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n.C.l.  (cont.) 

CDC  - 


X-RAY  - 
ASTM  - 
EPIC  - 


AMPIC  - 


a.  500  substance  and  350  property  descriptors  (includes 
variations  of  more  basic  propertied  550  conditions* 

b.  50  substance,  10  property,  and  50  condition  descriptors 
added  annually;  c.  3  to  4  substance,  1  to  2  property  and 

4  to  6  condition  descriptors  assigned  to  each  source 
document, 

N/A  .  . . . . . 

N/A 

a.  5100  substance  identification  and  56  property  identi¬ 
fication  descriptors  used  to  date-  b.  350  substance  de  - 
scriptors  added  annually,  properties  added  infrequently: 

c.  average  number  of  descriptors  for  each  document  not 
available;  d.  descriptor  entries  not  available. 

a.  unlimited  number  of  substance  identification  descriptors 
used  t.o  date,  no  property  identifications  used,  99  Process 
reactants  used  to  date:  b.  unlimited  substance  icent.  in 
current  annual  rate  of  addition  of  new  descriptors  but  no 
other  means  of  adding  descriptors  given;  c.  unlimited 
average  number  of  descriptors  assigned  to  each  source 
document  in  substance  identification,  no  other  descriptors 
assigned;  d.  unlimited  growth  of  descriptor  entries  for 
substance  ident. ,  no  other  growths  shown. 


2.  Indicate  which  means  are  used  to  index  the  substance  coverage  of  data  sources: 


A&M  - 

C.  A.  or  other  systematic  chemical  nomenclature  and 

OCT  Standard  Order  of  Arrangement. 

UC  - 

Chemical  fragment  codes. 

JANAF  - 

C.  A.  or  other  systematic  chemical  nomenclature. 

NBS  - 

C.  A.  or  other  systematic  chemical  nomenclature, 
order  of  arrangement  by  empirical  formula. 

standard 

TPRC  - 

C.  A.  or  other  systematic  chemical  nomenclature,  in 
combination  with  organization  rules  based  on  chemical 
composition  and  classes  of  substances. 

CDC  - 

Trivial  name  of  compounds,  compleses,  systems. 

etc. 

EPIC  - 

C.  A.  or  other  systematic  chemical  nomenclature. 

AMPIC  - 

C.  A.  or  other  systematic  chemical  nomenclature. 

X-RAY- 

N/A 

ASTM  - 

N/A 
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3«  What  nomenclature  standards,  thesauri,  code  manuals,  etc.  are  used 
to  control  indexing  of  substance  coverage? 


UC  - 
A&.M  - 
JANAF 
NBS  - 


index  cards,  controlled  vocabulary  using. 
IUPAC  and  CA  Recommendations, 
modified  Hill  Indexing  System, 

IUPAC,  Chemical  Abstracts. 


TPRC  - 
CDC  - 
X-RAY  - 
ASTM  - 
EPIC  - 
AMPIC  - 


Dictionary  of  Substances  -  numerical  code  designations. 
Characteristic  Coding  Guide. 

N/A 

N/A 

from  Trade  Name  to  Chemical  Name. 

no  control;  Index  all  molecules  of  5  atoms  and  under. 


4.  What  means  are  used  to  index  the  property  coverage  of  data  sources? 
UC  -  indexing  card,  controlled  vocabulary. 

A&M  -  as  originally  standardized  by  Rossini. 

AMPIC-  a  standardized  thesaurus. 


NBS  - 
T^PRC  - 
CDC  - 
X-RAY  - 
ASTM  - 
EPIC  - 

JANAF  - 


list  of  properties  -  10  catagories . 

list  of  7  properties  (actually  covers  13) 

own  terminology  with  hundred  relationships. 

N/A 

N/A 

Indexing  Guide  or  Glossary  of  Electronic  Properties 
(56  terms  and  400  Use  References) 

70  property  identification  key  words,  18  key  words 
indicating  measurement,  and  9  key  words  indicating 
the  state  or  condition  of  the  substance. 
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D.  Physical  Arrangement  of  Data  Source  Files  (Library)  See  Appendix  (1 


E.  Purging  of  Data  Source  File  (Library): 

1.  Indicate  the  method  of  purging  data  source  file  (library): 


uc  - 

file  is  not  purged. 

A&M  - 

no  data  source  file. 

JANAF  - 

file  is  not  purged. 

NBS  - 

file  is  not  purged. 

TPRC  - 

file  is  not  purged. 

CDC  - 

file  is  not  purged. 

X-RAY  - 

N/A 

ASTM  - 

N'A 

EPIC  - 

file  is  not  purged  except  for  errors  or  duplication. 

AM  PIC  - 

file  is  not  purged. 

F.  If  the  Data  Source  File  Supports  More  Than  One  Data  File  or  Compilation, 
Identify  the  File  and  Compilation  (s)  See  Appendix  B_ 


G.  Flow  Chart  the  Steps  Involved  in  Data  Source  File  Construction  and  Maintenance 
See  Appendix 


•oitnot  Communication 

Washington,  D.  C.  200  07 

C-478  Final  Report  -A-xiv-  ■■ 

20  March  1967 

in.  DATA  FILE  CONSTRUCTION 
A.  Screening  and  Evaluation 

1.  What  criteria  are  used  to  select  data  for  inclusion  in  the  data  file? 

a.  None,  except  that  the  data  fall  within  the  scope  of  the  data  file  and 
are  from  an  apparently  reliable  source. 

A  &  M 
JANAF 
NB3 
ASTM 

b.  Data  which  meets  the  file's  criteria  for  substance  purity,  mea¬ 
surement  reliability,  etc. 

X-RAY 

c.  Critically  qualified  data  which  meet  the  "one  best  estimate  " 
value  for  the  property. 

CDC 

X-RAY 

d.  Not  applicable. 

UC 

TPRC 
EPIC 
AM  PIC 

2.  When  is  data  screened  for  exclusion  from  the  data  file? 

a.  Prior  to  extraction  from  the  source  document. 

JANAF 

ASTM 

b.  Subsequent  to  extraction  from  the  source  document. 

CDC 

X-RAY 

C  Not  applicable. 

UC 

A  &  M 

NBS 

TPRC 

EPIC 

AMPIC 
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A.  (cont'd.  ) 

3.  What  percentage  of  source  documents  screened  provide  data  useful 
in  data  file  construction? 

JANAF  -  >90% 

NBS  -  95% 

CDC  -  25% 

X-RAY  -  65% 

ASTM  -  >95% 

B.  File  Accumulation  and  Purging 

1.  What  is  the  earliest  published  date  covered  by  data  items  processed 
to  present? 

A  &  M  -  1840 
JANAF  -  1925 
NBS  -  1780 
CDC  -  1960 
X-RAY  -  1940 
ASTM -1949 

2.  What  is  'he  total  number  of  data  sources  processed  to  date,  i.  e. , 
what  is  the  total  number  of  documents  or  data  sources  from  which 
data  have  been  extracted? 

A  &  M  -  27,  000 
JANAF  -25,  000 
NBS  -  55,  000 
X-RAY  -  13,  500 
ASTM  (JO  -  95,  000 
ASTM  &Y>  -25,  000 
ASTM  -  3,  300 
(mass  spec.) 

3.  What  percentage  of  potentially  useful  data  sources  (see  item  II -A -6) 
has  been  screened  for  processing  of  data  into  the  file? 

A  &  M  -  10% 

JANAF  -  100% 

NBS  -  -80% 

CDC  ••  >95% 

X-RAY  -  100% 

ASTM  -  100% 
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B.  (cont*d. ) 

4.  What  combinations  of  substances  and  properties  are  normally  found 

in  source,  documents? 

a.  Single  Substance: 

A  &  M  -  20% 

JANAF -  70% 

NBS  -  30% 

X-  RAY  -  70% 

ASTM  -  80% 

b.  Multiple  Substances: 

A  &  M  -  80% 

JANAF  -  30% 

NBS  -  70% 

X-RAY  -  30% 

ASTM  -  20% 

c.  Single  Property: 

A  &  M  -  40% 

JANAF  -  90% 

NBS -  60% 

X-RAY  -  - 
ASTM  - 100% 

d.  Multiple  Properties: 

A  &  M  -  60% 

JANAF  -  10% 

NBS  -  40% 

X-RAY  -100% 

ASTM  -  - 

5.  What  is  the  current  annual  rate  of  processing  data  into  the  file? 

A  &  M:  700  additional  source  items  included;  approximately  200  addi  ¬ 
tional  substances  included;  2-3  additional  properties;  500  new 
values  and/or  properties  not  previously  included;  2,  500  addi¬ 
tional  values  for  substances  and  properties  previously  included. 

JANAF:  1.  500  additional  source  items  included;  approximately  1,000 
additional  substances  included;  no  additional  properties  inclu¬ 
ded,  number  of  additional  values  not  available. 
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B.  (cont'd. ) 

MBS:  2,  200-2,  500  additional  source  items  included;  1,  000  additional 
substances  included;  no  additional  properties  included;  1,  500 
cards  of  new  values;  8,  500  cards  of  additional  or  updated  values 
for  substances  included, 

X-RAY;  1,  200  additional  substances  cov  ed. 

ASTM:  10,  000  Infrared  spectra  per  year;  4,  000  Ultraviolet  and 
visible  spectra  per  year. 


6.  What  percentage  of  data  extracted  is  in  each  of  the  following  forms; 

a.  Digital  or  alphanumeric 

A  &  M  -  100% 

JANAF  -  85% 

NBS -  70% 

TPRC  -  20% 

b.  Graphic 

JANAF  -  10% 

TPRC  -  80% 

X-RAY  -  100% 

ASTM  -  100% 

c.  Formula 

JANAF  -  5% 

NBS  -  15% 


d.  Other 

NBS  -  15% 


7,  What  is  the  fr  equency  of  updating  of  the  data  file? 

a.  Continuously  b.  N/A 

A  &  M  TPRC 

JANAF 
NBS 
CDC 
X  RAY 
ASTM 
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B.  (coat'd.) 

8.  How  are  data  files  purged? 

a.  Files  are  not  purged. 

NBS 

b.  Property  values  are  purged  when  more  reliable  values  become 
available. 

CDC 

X-RAY 

ASTM 

c.  Files  are  retired  when  a  data  compilation  is  published. 

JAISfAF 

d.  Not  applic  able  . 

A  &  M 
TPRC 

C.  Chemicai  Substance.  Coverage 

1.  What  is  the  intended  substance  coverage  for  this  Tie? 

A  &  M:  API  pure  hydrocarbons  and  related  compounds: 

TRC(MCA)  -  primarily  organic  and  inorganic  compounds 
outside  API  and  of  inter- e.st  fo  the  chemical 
Indus -ry 

JANAF:  All  chemical  compounds  of  interest  to  Dow  Chemical  Company 

NBS:  All  inorganic  (pure)  substances  and  all  pure  organic  substances 
and  well-defined  mixtures  thereof. 

TPRC:  All  substances 

CDC:  Cryogenic  fluids  (light  hydrocarbons,  atmospheric  gases, 
rare  gases) 

X-RAY:  Inorganic  and  organic  which  will  produce  diffraction  pat¬ 
terns;  inorganic  =  metals,  alloys,  minerals. 

ASTM:  Infrared,  Ultraviolet  spectroscopy  and  Mass  Spectrometry; 
Mostly  organic  compounds 
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C,  (contact.  ) 

2.  What  is  the  total  number.,  to  date,  if  chemical  substances  included  in 


this  data  file  ? 

A  &.  M: 

20,  000 

JANAF: 

10,  000 

NBS: 

40,000 

TPRC: 

42,  900 

X-RAY: 

13, 500 

ASTM- 

IR: 

95,  000 

UV: 

25,  000 

Mass  spec.: 

3,  300 

3.  What  is  the  distribution  of  coverage  among  the  following  groups  of 

chemical  substances? 

A  &.  M:  2%  single  chemical  elements;  10%  inorganic  compounds; 

87%  organic  compounds;  less  than  1%  organic  mixtures  and 
compounds. 

JANAF:  Less  than  5%  chemical  elements;  47.  5%  inorganic  compounds 
mixtures  and  systems;  47.  5%  organic  compounds,  mixtures 
and  systems. 

NBS:  4%  chemical  elements;  3%  single  chennca!  elements;  1%  ele¬ 
mental  mixtures  and  systems;  40%  inorganic  compounds, 
mixtures  and  systems;  25%  inorganic  compounds;  15%  inor¬ 
ganic  mixtures  and  systems;  56%  organic  compounds,  mix¬ 
tures  and  systems;  50%  organic  compounds;  6%  organic  mix¬ 
tures  and  systems. 

TPRC:  N/A 

X-RAY:  26%  chemical  elements  (2%  single  chemical  elements, 

98%  elemental  mixtures  and  systems  >;  37%  inorganic 
compounds,  mixtures  and  systems  <'70%  inorganic  com¬ 
pounds,  30%  inorganic  mixtures  and  systems);  37%  orga¬ 
nic  compounds,  mixtures  and  systems  (90%  organic  com¬ 
pounds,  10%  organic  mixtures  and  systems). 

ASTM:  10%  inorganic  compounds,  mixtures  and  systems;  S0% 
organic  compounds,  mixtures  and  systems. 
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D.  Property  Coverage 

1.  What  is  the  intended  property  coverage  for  this  file? 

A  &  M:  Simple  physical  properties.,  theimochemicai  data,  ideal 
gas  thermal  data,  viscosity. 

JANAF:  Thermochemical  and  related  fundamental  properties. 

NBS:  Chemical  thermodynamic  properties. 

TPRC:  N/A 

CDC:  Thermodynamic  transport  ^PVTi  relationships,  specific 
heat  and  entropy,  dielectric  constant. 

X-RAY:  Inter-planar  spacings,  relative  intensities,  radiation  source, 
wave  length,  filter,  t  ry stallographer  system,  space  group, 
lattice  parameters,  interaxial  angles,  chemical  formula, 
number  of  atoms  per  unit  structure,  indices  of  refraction, 
measured  density,  melting  point,  color,  hardness,  luster, 
chemical  analysis,  source,  heat  treatment,  temperature, 
name. 

ASTM:  Infrared  and  Ultraviolet  spectral  data;  mass  spectral  data. 

2.  What  is  the  total  number,  to  date,  of  properties  included  in  this 


data  file? 

A  &  M: 

60-70 

JANAF: 

70 

NBS: 

95 

TPRC: 

- 

CDC: 

15 

X-RAY: 

20-2  5 

ASTM- 

IR: 

95,000 

UV: 

25,  000 

Mass  spec: 

3,  300 

3.  How  many  new  properties  are  annually  added  to  the  file? 

A  &  M:  2-3 

JANAF:  0 

NBS:  0 

TPRC: 

CDC:  2 

X-RAY:  <  1% 

ASTM- 

IR:  30-40 

UV :  15-25 

Mas  spec :  15-25 


I 

I 

I 

I 

I 

I 

I 

I 


I 

I 

I 
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D,  (cont'd.) 

4.  What  is  the  average  number  of  values  per  property-substance  in¬ 
cluded  in  the  file? 

A  &  M:  25 

JANAF:  N/A 

NBS:  25 

TPRC:  N/A 

X-RAY:  15-20 

ASTM:  30-40 

F.  File  Formatting 

1.  Is  the  source  document  format  retained  or  is  the  format  converted  to 
a  standardized  format? 

a.  Format  retained: 

JANAF  (90%) 

b.  Format  converted: 

A  &  M 

JANAF  (10%) 

NBS 

CDC 

X-RAY 

ASTM 

c.  Not  applicable: 

TPRC 

2,  What  is  the  current  volume  of  the  file? 

a.  File  units: 

A  &  M:  200,  000 

JANAF:  8, 000 

NBS:  —200, 000 

b.  Data  records: 

X-RAY:  in  16th  supplement 

c.  Not  applicable: 

TPRC 

CDC 

ASTM 
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IV.  USER  SERVICING 


-A-xxii- 


Servlce  Philosophies  and  Emphases 
1. 


Kow  are  the  following  service  features  rated  in  your  current 
operations?  (see  figure  5) 


2. 


Who  are  the  users  of  the  products  of  this  data  compilation 
activity  ? 

Basic  Sc 

ientists 

Applied  Technologists 

UC 

40% 

60% 

A  &  M 

50% 

50% 

JANAF 

25% 

75% 

NBS 

40% 

60% 

TFRC 

15% 

80% 

5%  (mgmt) 

CDC 

20% 

80% 

X-Ray 

20% 

80% 

ASTM 

30% 

70% 

EPIC 

50% 

50% 

AM  PIC 

75% 

25% 

B.  Types  of  Output  Prepared 
1. 


Indicate  the  distribution  of  volume  of  output: 

Etta  source  reference 
vflticut  ctta  attracts 

Etta  extracts  wifeai 
ctta  source  references 

Etta  extracts  ad 
ctta  source  references 

UC 

100% 

A  &  M 

<1% 

99% 

JANAF 

100% 

NBS 

5% 

95% 

TPRC 

NA 

CDC 

30% 

70% 

X  -  Ray 

90  -  100% 

ASTM 

80% 

20% 

EPIC 

85% 

15% 

AM  PIC 

100% 
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2.  Indicate  the  distribution  of  output  according  to  the  type 
of  requirement: 


Generated  m  Response 
to  Inquiries 


Generated  for 
Periodic  Publication 


UC 

1% 

99% 

A  &  M 

100% 

JANAF 

100% 

NBS 

2% 

98% 

TPRC 

NA 

CDC 

10% 

85% 

X  -  Ray 

>89% 

ASTM 

99% 

EPIC 

~60% 

—40% 

AM  PIC 

100% 

5%  (other) 


C.  Usage  of  Data  Source  Files  (Library) 

1.  Indicate  the  frequency  of  usage  of  the  data  source  file  (library). 


UC  -  10  searches  per  year  in  response  to  external  queries  for 
for  document  references  or  to  identify  sources  of  data  re¬ 
quired  to  answer  data  queries;  daily  searches  by  staff  to 
identify  sources  of  data  required  to  prepare  or  update  data 
compilations. 

A  &  M  -100  searches  in  response  to  external  queries  for  document 
references  and/or  data;  continual  searches  by  staff  to  iden¬ 
tify  sources  of  data  required  to  prepare  or  update  data  com¬ 
pilations  (the  same  file  is  used  for  IV  C  I.  and  IV  D). 

JANAF  -4  magnetic  tape  searches  and  100  manual  searches  of  file 
folders  are  made  by  staff  to  identify  sources  of  data  re¬ 
quired  to  prepare  or  update  data  compilations. 

NBS  -  20  searches  per  year  in  response  to  external  queries  for 

document  references,  continuous  searches  by  staff  to  iden¬ 
tify  sources  of  data  required  to  prepare  or  update  data 
compilations. 
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C.  (con'fc) 

TPRC  -  Daily  manual  searches  by  staff  in  response  to  external 
queries  for  document  references  or  to  identify  sources 
of  data  required  to  answer  data  queries;  periodic  searches 
every  12  to  18  months  for  preparation  or  updating  of  a 
bibliography  or  accession  list  for  external  distribution 
{Retrieval  Guide);  continuous  searches  by  staff  to  identi¬ 
fy  sources  of  data  required  to  prepare  or  update  data 
compilations. 

CDC  -  Approximately  90  searches  in  response  to  external  queries 
extensive  bibliography  of  references,  approximately  200 
searches  by  staff  to  identify  sources  of  data  required  to 
answer  specific  data  queries,  approximately  10  to  15 
searches  by  staff  to  identify  sources  of  data  required  to 
prepare  or  update  data  compilations. 

X-Ray  -  not  applicable 

ASTM  -  not  applicable 

EPIC  -  75  -  80  searches  per  year  in  response  to  external  queries  for 
documents  references;  1  search  per  year  for  preparation 
or  updating  of  a  bibliography  or  accession  list  for  external 
distribution;  7  or  8  searches  by  staff  to  identify  sources  of 
data  required  to  prepare  or  update  data  compilation. 

AMPIC  -  30  searches  per  month  in  response  to  external  queries  for 
document  references;  5  searches  per  year  for  periodic 
searches  for  preparation  or  updating  of  a  bibliography  or 
accession  list  for  external  distribution;  no  searches  by 
staff  to  identify  sources,  no  searches  by  staff  to  identify 
sources  of  data. 
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2.  List  accession  lists  or  bibliographies  prepared  and  disseminated,  frequency 
of  publication,  forms  distributed  and  type  of  Indexing, 


Accession  List 
or  Bibliography 

Frequency  of 
Publication 
or  updating 

Forma 

Distributed 

Type  of  Indexing 

uc  - 

N/A 

A&M 

IuPAC  Bibliograftf' 

annual 

hardcopy 

substance - 
property  index 
&  guide 

JANAF  - 

N/A 

NBS  - 

IuPAC  Bibliography 

yearly 

hardcopy 

substance - 
property  index 
&  guide 

CDC  - 

Current  Awareness 
Service 

weekly 

hardcopy 

broad  subject; 
catalogue 

Bibliographies 

or 

irregularily 

on  request 

hardcopy 

hardcopy 

all  on  one  subj. 
(by  product  of 
compilation  efforts) 

on  topics  rajuBe'Bi 

Accession  Lists 

monthly 

hardcopy 

none 

X-RAY  - 

N/A 

ASTM  - 

N/A 

EPIC  - 

Interim  Reports 

Bibliographies 

in  respcnseto 
inquiries 

hardcopy 

hardcopy 

substance - 
prope  rty 

substance  - 
property 

in  r  expense  to 
inquiries 

AM  PIC  - 

Bibliographies 

semi-annual 

hardcopy 

reactants  and 
processes 

TPRC  - 

Retrieval  Guide  to  12-18  months 

Thermophysical 

Properties  Research 

Literature 

Bound 

Volumes 

substance - 
property,  type  of 
subject  ’.re  it- 
ment 

Bibliographies 

on  request 

Computer 

Printout 

substance - 
property  speci¬ 
fied 
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4.  Is  the  format  of  the  data  source  file  (library)  normally  retained  or  modified 
for  bibliography? 


uc  - 

N/A 

A&M  - 

separate  bibliography  cards  -  no  data  source  file  per  se 

JANAF  - 

N/A 

MBS  - 

modified 

TPRC  - 

retained 

CDC  - 

retained 

X-RAY  - 

N/h 

ASTM  - 

N/A 

EPIC  - 

retained 

AM  PIC  - 

retained 

D.  Usage  of  Data  File 

UC  -  uc  - 

N/A 

A&M  - 

100  searches  in  response  to  external  queries  for  docu¬ 
ment  references  and/or  data;  continual  searches  by  staff 
to  identify  sources  of  data  required  to  prepare  or  update 
data  compilations. 

JANAF  - 

Data  file  not  used  for  searches;  it  is  part  of  a  conplaticn  scheme. 

NBS  - 

200  searches  per  year  to  obtain  data  for  data  queries;  continu¬ 
ous  searches  to  obtain  data  to  prepare  or  update  compilations . 

CDC  - 

Approximately  35  searches  to  obtain  data  for  dataqueries,  10 
searches  tc  obtain  data  to  prepare  or  update  compilations. 

TPRC  - 

N/A 

X-Ray  - 

Very  few  searches  to  obtain  data  for  data  <-  eries;  searches  to 
obtain  data  required  to  prepare  or  update  data  compilation 
would  only  take  place  when  converting  cards  to  Book  Form  and 
when  replacing  an  old  pattern  with  a  new  one. 

ASTM  - 

Once  a  year  for  each  tabular  publication:  constant  merging. 

EPIC  - 

N/A 

AM  PIC  - 

N/A 

I 
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E.  Publication  Statistics  {See  Appendix  B) 


F.  Screening  and  Evaluation  for  Compilations 


L  What  criteria  are  used  to  select  data  for  inclusion  in  the  Compilation(s)? 


UC  - 


A&M  - 


JANAF  - 

NBS  - 


Data  which  meet  the  compilation's  criteria  for  substance 
purity,  measurement  reliability,  etc. ;  critically  qualified 
"one  best  value", (refers  to  Selected  Values  of  Thermodynamic 
Properties  of  Metals  &  Alloys,  and  Supplements) 

Critically  qualified  "one  best  value"( refers  to  API  Selected  Values 
of  Properties  of  Hydrocarbons  and  Related  Compounds,  and  TRC 
Selected  Values  of  Properties  of  Chemical  Compounds? 

Data  which  meet  the  compilation's  criteria  for  substance  purity, 
measurement  reliability,  etc;  maximum  of  3  spectra  are  included 
for  each  substance  (refers  to  API  and  TRC  Selected  Spectral  Data 
Catalogues) 

Critically  qualified  "one  best  Value"  (refers  to  JANAF  Thermo - 
chemical  Tables) 

Critically  qualified  "one  best  value"  (refers  to  Selected  Values 
of  Chemical  Thermodynamic  Properties  and  NBS  Technical  Notes) 


TPRC  - 

X-RAY  - 


ASTM  - 


EPIC  - 


None,  except  that  the  data  fall  within  the  coverage  of  the  data 
compilation  and  are  from  a  reliable  source;  critically  qualified 
"recommended  values"  (refers  to  TPRC  Data  Book) 

Data  which  meet  the  compilation's  criteria  for  substance  purity, 
measurement  reliability,  etc.  and  some  critically  qualified 
"one  best  value"  data  (see  Appendix  B  for  reference) 

None,  except  that  the  data  fall  within  *he  ^overage  of  the  data 
compilation  and  are  from  an  apparent*,  reliable  source  (refers  to 
Infrared,  Ultraviolet  and  Mass  Spectral  Data  Indexes) 

Data  which  meet  the  compilations  criteria  for  substance  purity, 
measurement  reliability,  etc.  ;  primary  sources  only  (refers  to 
Monographs) 


AM  PTC  -  N/A 

CDC  -  Data  which  meet  the  compilations  criteria  for  substance  purity, 

measurement  reliability,  etc.  for  preliminary  compilations;  Criti¬ 
cally  qualified  "one  best  value"  (refers  to  Monographs) 
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.  milted  from  the  data  file  is  normally  accepted  for 
2.  What  percentage  of  data  puiiea  irom 

inclusion  in  the  data  compilation  ? 

UC  -  N/A  _ 


uc  - 

AfcM 


Approximately  25%  (refers  to  API  Selected  Values  and  TRC 
Selected  Values) 


N/A  {refers  to  API  and  TRC  Selected  Spectral  Data  Catalogues) 

JANAF  '  100% 

NBS  -  Not  available 

TPRC  -  N/A 

X-RAY  -  100% 

ASTM  -  ioo% 

EPIC  -  N/A 

AMPIC  -  N/A 

CDC  -  N/A 

3  What  percentage  of  potentially  useful  data  items  in  the  data  file  have  been 
considered  for  inclusion  in  the  data  compilation  • 


UC  -  iw-/v 

A&M  .  100%  (API  -  Selected_Values  and  TRC  -  SelecJedValues) 

N/A  (API  and  TRC  Selected  Spectral  Data  Catalogues) 

JANAF  -  >95% 

NBS  -  90% 

TPRC  -  100% 

X-RAY  -  Not  available 

ASTM  -  Not  available 

EPIC  ■  100% 

AMPIC  -  Not  available 

Not  available 


100% 


A&M  - 


JANAF  - 
NBS  - 
TPRC  - 


ASTM  - 
EPIC  - 


I 

I 

I 

I 

I 

i 

I 

I 

I 

I 

I 

I 

I 

1 

I 

I 

I 

I 

I 


CDC  - 
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4.  What  is  the  current  annual  rate  of  issuing  data  in  compilations? 

UC  -  Some  new  substances  included  each  year,  some  revised  or 

updated  annually. 

A&M  -  N/A  (for  all  publications) 

JANAF  -  150  additional  source  items  included,  100  additional  substances 

included,  no  additional  properties  included,  values  updated  for 
100  substances. 


NBS  - 


Complete  revision  is  in  process. 


TPRC  - 


X-RAY- 
ASTM  - 
EPIC  - 
AMPIC  - 
CDC  - 


1,  391  additional  source  items  included,  3,  725  additional  specimens 
included,  no  additional  properties,  1086  pages  on  new  substances: 
and/or  properties  included,  and  642  additional  or  updated  pages 
for  substances  and  properties  previously  covered. 

Same  as  data  file  -  1,  200  additional  substances  covered. 

Not  available 

N/A 

N/A 

N/A 


G.  Chemical  Substance  Coverage  of  Compilations 

1.  What  is  the  intended  substance  coverage  of  this  compilation? 


UC  -  metals  and  alloys. 

A&M  -  Pure  hydrocarbons  and  related  compounds  (such  as  oxygen  and 

sulfur  derivatives  of  hydrocarbons)  (refers  to  API  Selected  Values) 


Pure  organic  and  inorganic  substances  outside  of  the  hydrocarbon 
(API)  area  (refers  to  TRC  S^L&rted  Values) 

API  Spectra  cover  Hydrocarbons  and  related  compounds  MCA 
spectra  cover  other  organic  and  inorganic  substances  -  coverage 
is  not  complete  (refers  to  API  and  TRC  Selected  Spectra  Data) 

JANAF  -  light  elements  and  their  oxides,  fluorides,  etc. 

NBS  -  all  inorganic  andorganic  substances  with  molecules  of  2  or  less  atoms. 

TPRC  -  all  substances 


X-RAY  - 
EPIC  - 

ASTM  - 
AMPIC  - 
CDC  - 


Not  available 

super  conductors,  semiconductors,  insulators,  metals,  ferroelec- 
trics.  ferromagnetics,  ferrites,  electroluminescents,  thermo araUas. 
Not  available 
Not  available 
Cryogenic  Substances. 
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2.  What  is  the  total  number,  to  date,  of  chemical  substances  included  in  this 
data  compilation? 

UC  -  Approximately  75  elements,  300-400  alloys 

A&M  -  3375  {API  -  Selected  Values) 

2000  {TEC  -  Selected  Values) 

not  available  (API-TRC  Selected  Spectral  Data) 

JANAF  -  625  (estimated) 

NBS  -  4,  500  chemical  compounds 

TPRC  -  7,  700  specimens,  substances  not  applicable. 

X-RAY  -  13500  (8500  inorganic,  5000  organic) 

ASTM  -  IR-95,  000;  UV-25,  000;  Mass  Spec-3,  300 

EPIC  -  Approximately  50 

AMPIC  -  N/A 

CDC  -  20  substances  (principal  fluids  and  common  mixtures  of  these 

fluids)  plus  metallic  elements  and  selected  alloys. 

3.  What  is  the  distribution  of  coverage  among  the  following  groups  of  chemical 

substances?  • 

UC  -  20%  single  chemical  elements,  80%  elemental  mixtures  and  systems. 

A&M  -  100%  inorganic  compounds  (API-Selected  Values) 

<2%  chemical  elements,  12%  inorganic  compounds,  85%  organic 
compounds  (TRC -Selected  Values) 

Almost  100%  organic,  few  inorganic  (API-TRC  SEU3CTED  SPECTRAL) 

JANAF  -  10% Chemical  elements,  90%  inorganic  compounds,  mixtures  &syst. 

NBS  -  Approx.  3%  chemical  elements,  87%  inorganic,  10%  organic  compounds 

TPRC  -  Not  available 

X-RAY  -  Not  available 

ASTM  -  10%  inorganic  compounds,  mixtures  and  systems,  90%  organic 

compounds,  mixtures  and  systems. 

EPfC  -  Covers  all  chemical  substances  concerned  with  electrical-elec¬ 
tronic  properties. 

AMPIC  -  Covers  ail  chemical  substances  concerned  with  molecular  and 
atomic  processes 

100%  chemical  elements,  mixtures  and  systems. 

1 


I 

1 

I 

I 

I 

I 

1 

I 

I 

I 

I 

I 

I 

I 

I 

I 


CDC  - 
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H,  Property  Coverage  of  Compilations 

1.  What  is  the  intended  property  coverage  for  this  compilation? 

UC  -  Thermodynamics,  properties,  phase  equilibria 

A&M  *  Thermodynamic  properties  (API-Selected  Values) 

Physical  and  thermodynamic  properties  (TRC-Seleeied  Values) 

Spectral  properties  (API-TRC  Selected  Spectral  Data) 

JANAF  -  Heat,  capacity,  entropy,  free  energy  function,  enthalpy,  heat  of 
formation,  equilibrium  constant 

NBS  -  Enthalpy  of  formation,  Gibbs  Energy  of  formation,  Entropy,  Heat 
capacity 

TPRC  -  Conductivity,  viscosity,  thermal  diffusivity,  specific  heat 

X-RAY  -  Not  available 

ASTM  -  Not  available 

EPIC  -  Electronic  Properties 

AMPIC  -  Not  available 

CDC  -  Not  available 


2.  What  is  the  total  number,  to  date,  of  properties  included  in  this  compilations? 

UC  -  20  distinct  properties,  not  including  properties  redundantly  expressed 

A&M  -  Approx.  20  (API  and  TRC  Selected  Values) 

1  each  (API-TRC  Selected  Spectral  Data) 

JANAF  --  2 

NBS  -  12 

TPRC  -  5 

X-RAY  -  2u  to  25,  approximately  for  each  publication 
ASTM  -  UV,  and  Mass  Spectra  =  3 
EPIC  -  56 

AMPIC  -  N/A 


CDC  - 


15  for  thermodynamic  properties  of  cryogenic  fluids. 
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3.  What  is  the  average  number  of  values  per  property -substance  included  in  the 


compilation? 

UC  - 

Elements  -  11  alloys,  increases  with  different  temperature  and 
composition  intervals. 

A&M  - 

15  (for  each  API  &  TRC  Selected  Values! 

1  to  3  (API-TRC  Selected  Spectral  Data) 

JANAF  - 

61 

NBS  - 

1 

TPRC  - 

N/A 

X-RAY  - 

15  to  20  for  each  publication 

EPIC  - 

N/A 

ASTM  - 

N/A 

AM  PIC  - 

N/A 

C  DC  - 

N/A 

Compilation  Formatting 

1.  Is  the  data  file  format  retained  or  is  the  format  converted  to  a  different 
publication  or  dissemination  format? 

UC  - 

format  converted 

A&M  - 

format  modified 

JANAF  - 

format  retained 

NBS  - 

format  converted 

TPRC  - 

format  retained 

x-ray  - 

not  available 

ASTM  - 

not  available 

EPIC  - 

format  retained 

AM  PIC - 

not  available 

CD C  - 

not  available 
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2.  What  is  the  current  volume  of  the  data  compilation? 

UC  -  N/A 

A&M  *  2391  data  sheets,  400,  000  information  units  (API  Selected  Values) 

780  data  sheets,  >28,  000  information  units  (TRC  Selected  Values) 

Infrared  Spectral  Data,  TRC-302  and  API-2650  data  sheets; 
Ultraviolet  Spectral  Data,  TRC-103  and  API-1014  sheets;  Raman 
Spectral  Data,  TRC-32  and  API-456  sheets;  MassSpectral  Data, 
TRC-149  and  API-2191  sheets;  Nuclear  Magnetic  Resonance 
Spectral  Data,  TRC-492  and  API- 583  sheets  (API-TRC  Selected 
Spectral  Data) 

JANAF  -  950  data  sheets 


NBS  ~  1260  pages. 

TPRC  -  3.  322  data  sheets 

X-RAY  -  8,  500  compounds  in  each  inorganic  compilation 

5,  000  compounds  in  each  organic  compilation 

ASTM  -  Approximately  123,300  spectra 

EPIC  -  3,  519  data  sheets 

AMPIC  -  N/A 

CDC  -  N/A 
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List  of  Chemical  Data  Centers 
and 
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List  of  Chemical  Data  Centers  and  Chemical  Data  Compilations 

A.  The  Joint  Committee  on  X-Ray  Powder  Diffraction  Data, 

American  Society  for  Testing  and  Materials 

1916  Race  Street,  Philadelphia,  Pennsylvania 

1.  X~Ray  Powder  Data  File.  Organic,  and  inorganic.  Sets  1-5  in 
Tabular  Format,  Revised.  1960. 

2.  ASTM  X-Ray  Powder  Diffraction  Card  File.  Organic,  and  inor¬ 
ganic.  3x5  card  format,  updated  annually. 

3  ASTM  X-Ray  Powder  Diffraction  Card  File.  Organic,  and  inor¬ 
ganic.  Keysort  format,  updated  annually. 

4.  Matthews  Coordinate  Index  for  Inorganic  Compounds.  Optical 
coincidence,  updated  annually. 

5.  The  Fink  Inorganic  Index  to  the  X-Ray  Powder  Diffraction  File. 
Published  annually. 

6.  KWIK  Index  for  Inorganics.  Published  annually. 

7.  List  oi  411oy-Phased  Designations  of  the  X-Ray  Diffraction  Data 

File.  Inorganic.  Sections  1-12.  1963. 

8.  X-Ray  Powder  Diffraction  Data  File.  Organic,  and  inorganic. 

Sets  1-16,  magnetic  tape.  Updated  annually. 

9.  The  Hanawalt-Davey  Index  to  the  X-Ray  Powder  Diffraction  File. 
Organic,  and  inorganic.  Published  annually. 

10.  The  IBM-Tab  (Hollerith)  Card  System,  An  Index  of  the  ASTM  X-Ray 
Powder  Diffraction  Data  File.  Organic,  and  inorganic.  Updated 
annually. 

B.  American  Society  for  Testing  and  Materials  (ASTM) 

L916  Race  Street,  Philadelphia,  Pennsylvania 

The  Wyandotte -ASTM  Spectral  Data  Project 
Wyandotte  Chemicals  Corporation 
Wyandotte,  Michigan 

1.  Mass  Spectrometry,  Data  and  Name  File.  Hollerith  cards.  1963. 

2.  Mass  Spectrometry,  Data  and  Chemical  Structure  File.  Hollerith 
cards.  1963. 

3,  Molecular  Formula  List  of  Compounds,  Names  and  References  to 
Published  Infrared  Spectra.  Hollerith  cards.  Updated  annually. 

4,  The  Wyandotte -ASTM  Punch  Card  Index  for  Infrared  Spectra. 
Includes  data  on  near  infrared,  0.  7-3.  5  microns,  infrared,  2-16 
microns,  and  far  infrared,  11-36  microns.  Hollerith  cards. 
Updated  annually. 
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5.  The  Wyandotte -ASTM  Punch  Card  Project  for  Ultraviolet  and  Visi¬ 
ble  Absorption  Spectra.  200-420  mil  coding  resolution  for  ultravio¬ 
let,  350-845  mil  coding  resolution  for  visible  spectra.  Hollerith 
card  format.  Updated  annually. 

I  ,  Special  Technical  Publication  358,  Serial  Number  List  of  Com¬ 
pound  Names  and  References  to  Published  Infrared  Spectra. 

ASTM  1963. 

7.  Special  Technical  Publication  356,  The  Index  of  Mass  Spectral 
Data.  ASTM  1963. 

3.  Special  Technical  Publication  357.  Molecular  Formula  List  of 
Compound  Names  and  References  to  Published  Ultraviolet  and  Visi  ¬ 
ble  Spectra.  ASTM  1963. 

9.  Special  Technical  Publication  331.  and  Supplement.  The  Molec¬ 
ular  Formula  List  of  Compound  Names  and  References  to  Published 
Infrared  Spectra.  ASTM  1962. 

10.  Special  Technical  Publication  329,  and  Supplement.  Coder  for 
Periodical  Titles.  ASTM  1964. 

C.  The  University  of  California,  Berkeley,  California 

Thermodynamic  Properties  of  Metals  and  Alloys  Bata  Center 

1.  Selected  Values  of  Thermodynamic  Properties  of  Metals  and  Alloys. 
John  Wiley,  1963. 

2.  Selected  Values  of  Thermodynamic  Properties  of  Metals  and  Alloys. 
Issued  in  loose-leaf  form  as  supplements. 

D.  Texas  A  &  M  University,  College  Station,  Texas 

Thermodynamics  Research  Center  (formerly  MCA)  Projects  on 

Properties  of  Chemical  Compounds 

American  Petroleum  Institute  Project  44 

1.  Selected  Values  of  Properties  of  Hydrocarbons  and  Related  Com¬ 
pounds.  Updated  semi  -annually. 

2.  The  API  Catalog  of  Infrared  Spectral  Data. 

3.  The  API  Catalog  of  Ultraviolet  Spectral  Data. 

4.  The  API  Catalog  of  Raman  Spectral  Data. 

5.  The  API  Catalog  of  Mass  Spectral  Data. 

6.  The  API  Catalog  of  NMR  Data. 
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7.  Selected  Values  of  Properties  of  Chemical  Compounds.  Thermo¬ 
dynamics  Research  Center  (formerly  MCA).  Updated  semi¬ 
annually. 

8.  TRC  Catalog  of  Infrared  Spectral  Data. 

9.  TRC  Catalog  of  Ultraviolet  Spectral  Data. 

10.  TRC  Catalog  of  Raman  Spectral  Data. 

11.  TRC  Catalog  of  Mass  Spectral  Data. 

12.  TRC  Catalog  of  NMR  Spectral  Data. 

E.  National  Bureau  of  Standards,  Gaithersburg,  Maryland 

Selected  Values  of  The  r  mo  dynamic  Properties  of  Inorganic  Sub¬ 
stances  Data  Center 

1.  Selected  Values  of  Thermodynamic  Properties  of  Inorganic  Sub¬ 
stances.  NBS  Circular  500.  Updated  periodically. 

F.  Purdue  University,  Lafayette,  Indiana 

Thermophysical  Properties  Research  Center  (TPRC) 

1.  Thermcphysical  Properties  Research  Center  Data  Book.  Tabular 
format.  Updated  semi-annually. 

2.  Retrieval  Guide  to  Thermophysical  Properties  Research  Literature. 
Tabular  format.  Published  yearly. 

G.  Dow  Chemical  Company,  Midland,  Michigan 

Joint  Army-Navy-Air  Force  (JANAF)  Thermochemical  Tables 

1.  JANAF  Thermochemical  Tables.  In  tabular  format  and  magnetic 
tapes.  Updated  luarterly. 

*H.  Atomic  Energy  Commission 

Oak  Ridge  National  Laboratory,  Oak  Ridge,  Tennessee 

Atomic  and  Molecular  Processes  Information  Center 

1.  Bibliography  on  Reaction  Processes.  Tabular  format.  Published 
semi -yearly. 

*  I .  National  Bureau  of  Standards,  Boulder,  Colorado 

Cryogenic  Data  Center 

1.  Thermodynamic  Properties  of  Helium  at  Low  Temperatures  and 
High  Pressures.  May  1959. 

2.  Thermodynamic  Properties  of  Helium  from  3-300°K,  Between 
.  5  and  100  Atmospheres.  Technical  Note  #154,  1962. 
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3.  Thermodynamic  Properties  of  Helium  from  6-540°  R,  Between 
10  and  1500  PSIA.  Technical  Note  #154a.  January  1962. 

4.  The  Solubility  of  Helium  in  Liquid  Hydrogen.  The  Journal  of 
Chemistry  and  Physics,  Volume  40.  April  1964. 

5.  A  Tabulation  of  the  Thermodynamic  Properties  of  Nor  mal  Hy¬ 
drogen  from  Low  Temperatures  to  300°K.  and  from  1  to  100 
Atmospheres.  Technical  Note  #120.  November  1961. 

6.  A  Tabulation  of  the  The rmo dynamic  Properties  of  Normal  Hy¬ 
drogen  from  Low  Temperatures  to  540°R,  and  from  10  to  1500 
PSIA.  Technical  Note  #  120a.  June  1962. 

7.  A  Compilation  and  Correlation  of  the  P-V-T  Data  of  Normal 
Hydrogen  from  Saturated  Liquid  to  80° K.  Advances  in  Cryo¬ 
genic  Engineering,  Volume  5  (Procedures  of  the  1959  Cryogen¬ 
ic  Engineering  Conference.  Plenum  Press,  Inc.  1960). 

8.  The  Densities  of  Saturated  Liquid  Hydrogen.  Cryogenics  #2, 
volume  2.  December  1961. 

9.  Compilation  of  the  Physical  Equilibria  and  Related  Properties 
of  the  Hydrogen -Helium  System. 

10.  Thermodynamic  Properties  of  Parahydrogen  from  1-22°K. 
Technical  Report  #1. 

11.  Specific  Heat  of  Saturated  Liquid  Parahydrogen  from  15-32°K. 
Cryogenics  #2,  volume  5.  September  1962. 

12.  Vapor  Pressure  of  20°K  Equilibrium  Hydrogen.  Cryogenics  #2, 

volume  4,  1962. 

13.  Specific  Heat  at  Constant  Volume  of  Parahydrogen  at  Tempera¬ 
tures  from  15~90°K  and  Pressures  to  340  ATM.  Cryogenics  #2, 
volume  6.  December  1962. 

14.  Pressure-Density  Temperature  Relations  of  Fluid  Parahydrogen 
from  15-100°K  at  Pressures  to  350  Atmospheres.  Technical 
Note  #^7a.  Number  2.  April  1963. 

15.  The  Orihobaric  Densities  of  Parahydrogen,  Derived  Heats  of 
Vaporization  and  Critical  Constants.  Cryogenics  #2.  March  1963. 

16.  Thermodynamic  and  Related  Properties  of  Parahydrogen  from 
the  Triple  Point  to  100°K  at  Pressures  to  340  Atmospheres. 
Monograph  #94.  August  1965. 

17.  Refractive  Index  Dispersion  of  Liquid  Hydrogen.  Technical  Note 
#323.  September  1965. 
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18.  Low  Temperature  Transfer  Properties  of  Commercial  Metals 
and  Alloys,  Journal  of  Applied  Physics,  Volume  31,  #3. 

March  1960. 

19.  Low  Temperature  Transfer  Properties  of  Copper  and  Its  Dilute 
Alloys,  Pure  Copper,  Annealed  and  Cold-drawn.  Physics  Re¬ 
view,  #115,  #3.  July  1959. 

20.  Low  Temperature  Transport  Properties  of  Commercial  Metals 
and  Alloys.  IV  Reactor  Grade  BE,  MO  and  W.  Journal  of  Ap¬ 
plied  Physics,  volume  31,  #7.  July  I960. 

21.  Mechanical  Properties  of  Structural  Materials  at  Low  Tempera¬ 
tures:  a  Compilation  from  the  Literature.  Monograph  #13, 

June  1960. 

22.  Low  Temperature  Transport  Properties  of  Commercial  Metals 
and  Alloys.  HI  Gold-Cobalt.  Journal  of  Applied  Physics,  vol¬ 
ume  31.  March  I960. 

23.  Thermal  Expansion  of  Technical  Solids  at  Low  Temperatures; 

a  Compilation  from  the  Literature.  Monograph  #29.  May  1961. 

24.  Specific  Heats  and  Enthalpies  of  Technical  Solids  at  Low  Tempera 
tures.  Monograph  #21.  October  1960. 

*  J  .  Hughes  Aircraft  Company,  Culver  City,  California 

Electrical-Electronic  Properties  Information  Center  (EPIC) 

1.  Cadmium  Oxide  (DS149,  M.  Neuberger,  June  1966). 

2.  Bismuth  Telluride -Bismuth  Selenide  Systems  (DS147,  M.  Neu¬ 
berger,  January  1966). 

3.  Zinc  Sulfide  (DS135,  D.  B.  Carter,  November  1966). 

State -of-the -Art  Reports: 

4.  Polimide  Plastics,  A  State -of-the -Art  Report.  (S8,  John  Milek, 
October  1965) 

5.  Niobium  Alloys  and  Compounds  (DS148,  Donald  L.  Grisby, 
January  28,  1966). 


*  Those  Chemical  Data  Compilations  in  monograph  form  were  analyzed 
qualitatively. 
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Chemical  Thermodynamic  Data  Group,  Thermochemistry  Section,  Physical  Chem¬ 
istry  Division,,  Institute  of  Basic  Standards,  National  Bureau  of  Standards  (Circu¬ 
lar  500). 

The  National  Bureau  of  Standards  Thermochemistry  Section  collects,  correlates 
and  evaluates  data  on  chemical  thermodynamic  properties  of  all  inorganic  sub- 
stances  and  organic  substances  having  molecules  containing  not  more  than  two 
carbon  atoms.  The  tables  contain  values,  where  known,  of  the  enthalpy  and 
Gibbs  energy  of  formation,  enthalpy,  entropy  and  heat  capacity  at  298.  15°K 
(25°C)  and  the  enthalpy  of  formation  at  Q°K.  The  Standard  Order  of  Arrange¬ 
ment  (the  principle  of  latest  position)  is  used  to  list  the  compounds,  so  that  in 
a  given  element  table,  all  the  compounds  of  that  elements  will  be  grouped  to¬ 
gether. 

Their  reference  source  file  consists  primarily  of  technical  reports  published  in 
journals,  periodicals  and  government  reports.  They  make  technical  evaluations 
regarding  the  accuracy,  quality  and  significance  of  information  that  has  been  in¬ 
troduced  into  the  system.  They  prepare  state-of-the-art  reviews,  correlate  in- 
formation  and  data,  and  prepare  information  chi  selected  classes  of  compounds 
n,v  publication.  All  data  and  values  are  critically  evaluated  and  correlated  to 
obtain  the  best  values. 

The  Center  also  provides  answers  to  technical  questions  and  inquiries.  These 
references  and  their  principal  measured  values  are  extracted  on  to  3  x  5  cards 
and  entered  into  a  large  file  which,  within  our  system  concept,  combines  the 
data  source  file  index  and  the  data  file.  Upon  the  decision  to  compile  or  up¬ 
date  values  for  a  substance -property  field,  relevant  references  and  data  can 
then  be  retrieved  from  the  file,  which  is  organized  on  the  basis  of  substance 
and  property  coverage. 
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Circular  500 

Chemical  Thermodynamic  Data  Group 
Thermochemistry  Section,  Physical  Chemistry  Division 
Institute  of  Basic  Standards 
National  Bureau  of  Standards 

Physical  A r rang  ■  y  nt  of  Data  Source  Files  (Library? 

The  source  document  t'iioi  and  indices  and  their  construction  are  indica¬ 
ted  with  the  proper  labe  and  recording  media.  The  following  code  is  used 
to  indicate  each  element  included  in  the  file  and  its  function  as  a  mode  of 
access  for  searching:  P  *  Primary  access  mode;  S  =  Secondary  access 
mode;  N  =Not  used  as  an  access  mode. 


Afar- 

tactic  C*$nc  Selected  Cbrrent 

Dsoxrat 

Rdfenaxe 

Data 

Data 

Data 

Suras  Work 

File 

File 

File 

File 

File 

File 

Files 

Recording  Media 

5x8  mi¬ 

3x5 

3x5 

3x5 

3x5 

* 

3x5 

8  l/2x  11 

crocopy 

cards 

cards 

cards 

cards  cards  sheets,  cal  - 

+  tefcufar 
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Joint  Army-Navy- Air  Force  (JAJNAF)  Thermochemical  Tables,  Dow  Chemical 
Company,  Midland,  Michigan, 

The  preparation  of  the  JANAF  Thermochernieal  Tables  was  initiated  as  part  of 
a  large  experimental  program  carried  on  at  several  laboratories .  In  the 
early  phases,  these  laboratories,  under  the  coordination  of  the  JANAF  Thermo- 
chemical  Panel,  contributed  data  which  were  issued  without  critical  evaluation. 
Now  most  new  data  is  screened  from  the  open  literature,  principally  Chemical 
Abstracts,  and  is  all  critically  evaluated  prior  to  dissemination. 

Substance  coverage  of  this  data  compilation  activity  is  confined  to  the  light 
elements  (first  two  periods  of  the  periodic  table)  plus  some  selected  heavy 
elements.  Approximately  one-fourth  of  the  elements  in  the  periodic  table  are 
covered.  Properties  coverage  includes  standard  thermodynamic  functions  as 
a  function  of  temperature  (e.g. ,  heat  capacity,  entropy,  Gibbs  free  energy 
and  heat  of  formation). 

The  principal  users  of  the  compilation  are  specialists  who  calculate  perfor¬ 
mance  of  rocket  fuels. 

The  total  number  of  data  sheets  in  the  compilation  is  950.  Approximately 
200  data  sheets  are  issued  annually,  fifty  each  quarter.  Of  the  200  issued 
annually,  100  are  data  sheets  for  chemical  substances  not  previously  covered, 
and  100  are  revisions  of  data  sheets  previously  issued.  The  activity  main¬ 
tains  a  data  source  index  file,  a  data  source  file  and  a  data  file.  The  distri¬ 
bution  of  effort  among  the  operations  performed  by  the  activity  is  concen¬ 
trated  in  the  data  evaluation  operation  (approximately  70%  of  the  total). 
Automated  methods  are  used  both  to  screen  the  literature  for  location  of 
source  documents  and  to  compute  the  tables  of  thermodynamic  functions. 

The  use  of  ADP  methods  has  recently,  on  a  trial  basis,  been  extended  to 
printing  of  the  tables  of  functions. 
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The  source  document  files  and  indices  and  their  construction  are  indica- 
ted  with  the  proper  label  and  recording  media.  The  following  coue  xs 
used  to  indicate  each  element  included  in  the  file  and  its  function  as  a 
mode  of  access  for  searching:  P  =  Primary  access  mode;  S  -  Secondary 
access  mode;  N  =  Not  used  as  an  access  mode. 
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Thermodynamic  Properties  of  Metala  and  Alloys  Data  Center,  University  of 
California,  Berkeley,  California. 

The  Thermodynamic  Properties  Data  Center  compiles  data  on  all  substances 
that  conduct  electricity  electronically,  which  includes  metals  and  alloys .  They 
evaluate  all  reported  values  to  produce  critical  "beet  estimate*1  thermodynamic 
values  for  all  standard  thermodynamic  functions,  thermal  functions,  vapor 
pressures,  activity  coefficients  and  partial  molar  quantities. 

To  support  this  critical  evaluation,  the  literature  is  screened  through  direct 
journal  scanning  and  abstract  journal  screening.  Relevant  sources  {coverage 
from  1910  to  date)  are  indexed  and  entered  in  the  project's  optical  coincidence 
retrieval  index,  to  be  retrieved  and  evaluated  upon  the  decision  to  compile  or 
update  the  values  for  an  element  or  alloy.  Selected  values  were  published  in 
book  form  in  1963  as  Selected  Values  of  Thermodynamic  Properties  of  Metals 
and  Alloys  and  are  now  updated  and  expanded  by  frequent  loose-leaf  supplements. 
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Thermodynamic  Properties  of  Metals  and  Alloys  Data  Center 

University  of  California 
Berkeley,  California 


Physical  Arrangement  of  Data  Source  Files  (Library) 

The  source  document  files  and  indices  and  their  construction  are  indica¬ 
ted  with  the  proper  label  and  recording  media.  The  following  code  is 
used  to  indicate  each  element  included  in  the  file  and  its  function  as  a 
mode  of  access  for  searching:  P  =  Primary  access  mode;  S  *  Secondary 
access  mode;  N  =  Not  used  as  an  access  mode. 
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Cryogemc  Data  Center,  Cryogenic  Engineering  Laboratory,  National  Bureau  of 
Standards,  Boulder,  Colorado, 

The  Cryogenic  Data  Center  is  engaged  in  the  critical  evaluation  and  compilation 
of  data  on  thermodynamic,  transport  and  other  the r mophye icai  properties  for 
the  principal  cryogenic  fluids  and  common  mixtu  res  of  these  fluids  used  at  low 
temperatures.  The  scope  of  this  compilation  activity  also  includes  the  proper¬ 
ties  of  metallic  elements,  selected  alloys,  and  element  dielectrics.  The  ther¬ 
modynamic  properties  of  fluids  include  pressure-volume-temperature,  vapor 
pressure,  latent  heat,  saturation  densities,  isothermal  compressibility,  vol¬ 
ume  expansivity,  entropy,  enthalpy,  internal  energy,  specific  heat  and  velocity 
of  sound.  The  transport  properties  of  fluids  include  thermal  conductivity, 
Prandtl  number,  thermal  diffusion,  viscosity,  diffusion  and  coefficients,  while 
other  thermophysical  properties  include  dielectric  constant,  electrical  resisti¬ 
vity,  magnetic  properties,  refractive  index,  surface  tension,  optical  proper¬ 
ties  and  dielectric  breakdown. 

This  group  monitors  the  literature  continually  for  the  above  program.  As 
specific  tasks  are  undertaken,  comprehensive  bibliographies  are  prepared  from 
their  magnetic  tape  subject  index  of  relevant  source  documents.  These  bibliog¬ 
raphies  arts  sometimes  published.  Task  notebooks  are  made  for  the  preliminary 
selection  and  evaluation  of  data,  and,  where  feasible,  preliminary  memoranda 
sheets  listing  the  better  data  available  are  issued.  Their  final  evaluations 
yield  critical  review  monographs,  presenting  tables  of  "best  estimate"  values. 

All  cataloging  and  coding  within  the  data  source  file  is  converted  to  machine- 
readable  form  for  automated  processing.  The  programs  used  are  for  searching, 
dictionary  term  identification,  catalog  tape  output,  tape  updating,  etc.  They 
issue  a  weekly  Current  Awareness  service,  custom  bibliographies,  preliminary 
data  and  advice  on  ihe  thermodynamic  and  transport  properties  of  cryogenic 
fluids.  Further,  they  publish  cryogenic  laboratory  publications  and  reports  in 
tabular  form. 
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Cryogenic  Data  Center 

Cryogenic  Engineering  Laboratory 
National  Bureau  of  Standards 
Boulder,  Colorado 

Physical  Arrangement  of  Data  Source  Files  (Library) 

The  source  document  files  and  indices  and  their  construction  are  indica¬ 
ted  with  the  proper  label  and  recording  media.  The  following  code  is 
used  to  indicate  each  element  included  in  the  file,  and  its  function  as  a 
mode  of  access  for  searching:  P  *  Primary  access  mode;  S  -  Secondary 
access  mode;  N  *Not  used  as  an  access  mode. 
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Thermophysical  Properties  Research  Center,,  Purdue  University  Research  Park, 
Lafayette,  Indiana~{TPRC) 

The  principal  output  of  TPRC  is  the  TPRC  Data  Book  on  the  Thermophysical  Prop 
erties  of  Materials,  The  Data  Book  presents  experimental  values  from  the  open 
literature  on  thermal  conductivity,  viscosity,  radiative  properties,  thermal  dif- 
fusivity  and  specific  heat  of  materials  as  comparative  curves  on  a  master  plot, 
TPRC  develops  a  curve  of  recommended  values  for  the  property  and  the  material 
covered  over  a  wide  temperature  range,  from  the  plot,  the  sample  specifications 
given  and  the  data  points  rendered.  This  work,  supported  principally  by  the  Air 
Force,  with  NSRDS  support  on  selected  chemically  pure  substances,  is  made 
available  both  in  the  Data  Book  and  in  NSRDS  reference  monographs.  All  rele¬ 
vant  information  developed  in  other  data  projects  undertaken  by  TPRC  for  speci¬ 
fic  purposes,  such  as  their  current  revision  of  the  Air  Force's  Handbook  on 
Thermophysical  Properties  of  High  Temperature  Solid  Materials,  is  incorpora¬ 
ted  into  the  Data  Book.  ~  ~ 

The  Documentation  Division  of  TPRC  supports  this  effort  with  nearly  34, 000  in¬ 
dexed  references,  completely  covering  the  literature  on  thermal  conductivity, 
specific  heat,  viscosity,  emissivity,  mass  diffusivity,  thermal  diffusivity  and 
Prandtl  number  from  1920  to  1965,  This  nearly -100%  coverage  will  be  made 
available  to  the  public  in  the  forthcoming  Volume  HI  of  the  "Retrieval  Guide  to 
Thermophysical  Properties  Research  Literature",  which,  in  essence,  will  give 
to  its  holder  access  to  TPRCs  entire  literature  collection  at  the  time  of  prin¬ 
ting. 

The  Data  Table  and  Documentation  Divisions  constitute  support  for  TPRC's 
long-range  goal  of  documenting  and  analyzing  investigated  areas  on  the  ther¬ 
mophysical  properties  of  materials,  and  then  proceeding  to  experimental  and 
theoretical  determination  of  values  for  the  gaps  in  knowledge  which  have  been 
uncovered. 
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Physical  Arrangement  of  Data  Source  Files  (Library) 

The  source  document  files  and  indices  and  their  construction  are  indica¬ 
ted  with  the  proper  label  and  recording  media.  The  following  code  is 
used  to  indicate  each  element  included  in  the  file  and  its  function  as  a 
mode  of  access  for  searching:  P  »  Primary  access  mode;  S  «  Secondary 
access  mode;  N  *  Not  used  as  an  access  mode. 
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Thermodynamics  Research  Center,  Texas  A  &  M  University,  College  Station, 
Texas  (TRC). 

TRC  is  the  housing  organization  for  the  generation  and  publication  of  a  multi¬ 
plicity  of  chemical  data  compilations.  For  the  American  Petroleum  Institute, 
TRC  directs  API  Project  44,  to  gather,  produce  internally,  evaluate  and  dis¬ 
seminate  from  the  open  literature  the  "best  estimate"  values  for  thermody¬ 
namic  and  thermochemical  properties  of  selected  hydrocarbons  and  for  rela¬ 
ted  compounds  in  cumulative,  internally  consistent  tables.  They  also  prepare 
and  distribute  catalogs  of  Ultraviolet,  Infrared,  Nuclear  Magnetic  Resonance, 
Raman  and  Mass  Spectral  data  of  hydrocarbons  and  related  compounds, 

TRC,  with  support  from  NSRDS  and  Texas  A  &  M,  compiles  the  "Selected 
Values  of  Properties  of  Chemical  Compounds"  (iormerly  supported  by  the 
Manufacturing  Chemists'  Association)  on  substances  other  than  those  under 
the  scope  of  API  but  of  interest  to  the  chemical  industry.  These  cumulative 
tables  are  presented  in  the  same  format  as  those  of  API  Project  44.  Like¬ 
wise,  for  substances  outside  the  scope  of  API  Project  44,  they  distribute  cata¬ 
logs  of  Ultraviolet,  Infrared,  Nuclear  Magnetic  Resonance,  Raman  and  Mass 
Spectral  data. 

Original  spectra  for  all  the  spectral  data  catalogs  are  furnished  by  contribu¬ 
ting  laboratories.  Supporting  the  compilation  of  selected  values  is  a  massive 
3x5  card  file  which,  within  our  system  concept,  combines  the  data  source 
file  index  and  data  file,  and  which  is  organized  on  the  basis  of  substance  and 
property  coverage.  References  and  measured  values  gleaned  from  a  thor¬ 
ough  search  of  the  open  literature  are  entered  on  cards  and  merged  into  the 
file.  The  relevant  references  and  data  can  then  be  retrieved  upon  the  deci¬ 
sion  to  compile  values  for  a  substance -property  field.  In-house  experimental 
and  theoretical  research  is  also  conducted  for  the  determination  of  needed 
values  and  the  evaluation  of  experimental  values,  as  well  as  the  calculation  of 
"best  estimate"  values  for  publication  in  the  tables  of  selected  values. 

In  addition  to  these  data  compilation  efforts,  TRC  annually  contributes  a  bib¬ 
liography  of  the  articles  on  organic  substances  which  have  appeared  in  the 
open  literature  of  the  previous  year  to  the  International  Union  of  Pure  and  Ap¬ 
plied  Chemistry.  For  access  to  this  bibliography,  a  detailed  substance - 
property  index  is  prepared. 


toltflo*  Communication 

Washington,  O.  C.  200  07 

C-478  Final  Report  -C-xxii 

20  March  1967 


Them,  ct;  .»r  ^ica  Research  Center 

Texas  A  &  M  University 
College  Station,  Texas 


Physical  Arrangement  of  Data  Source  Files  (Librai 


The  source  document  files  and  indices  and  their  construction  are  indica 
ted  with  the  proper  label  and  recording  media.  The  following  code  is 
used  to  indicate  each  element  included  in  the  file,  and  its  function  as  a 
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access  mode;  N  3  Not  used  as  an  access  mode. 
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3x5  9trds 

File  Element 

Source  Document 

(for  IUFAC) 

Document  Abstracts  (extracts) 

N 

N 

Personal  Author 

N 

P  (1st  author) 

Journal  Source 

N 

N 

Type  of  Source  Document 

N 

N 

Date  of  Publication 

Accession  Number 

N* 

N 

Substance  Identified 

p** 

N 

Property  Identified 

S 

N 

Type  of  Measurement 

(substance  purity,  et  al.) 

N 

N 

Compound  Identified 

S 

N 

*  except  cards  filed  chronologically  under  "Conpound  Identified" 

**  by  major  classes,  e.g. ,  alcohols,  et  al. 
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The  Wyandotte  -  ASTM  Spectral  Data  Project,  Wyandotte  Chemicals  Corporation, 
Wyandotte,  Michigan  (sponsored  by  the  American  Society  for  Testing  and  Ma¬ 
terials  (ASTM1  Philadelphia,  Pennsylvania). 

ASTM  prepares  and  distributes  the  Wyandotte -ASTM  punched  card  index  for 
searching  and  correlating  Infrared,  Ultraviolet  and  Visible  Spectral  Absorp¬ 
tion  data  and  chemical  structure.  The  system,  developed  at  Wyandotte  Chemi¬ 
cals  Corporation  by  Dr.  L.  E.  Ruentzel  and  subsequently  adopted  by  ASTM 
Technical  Committee E- 13  on  Absorption  Spectroscopy,  makes  use  of  Hollerith 
cards  and  sorters,  and  magnetic  tape  and  computers  with  many  different  types 
of  output,  including  optical  coincidence,  tab  card,  computer  tapes  and  tabu¬ 
lar  publications. 

The  system  was  designed  to  facilitate  the  searching  of  spectral  absorption 
data  for  the  purpose  of  matching  spectrograms  in  qualitative  analysis  and  for 
correlating  chemical  structure  and  absorption  band  positions.  A  standardized 
coding  system  was  developed  for  automated  manipulation  of  the  data.  The  data 
file  includes  the  spectral  position  of  all  important  bands  of  a  published  spectro¬ 
gram,  the  structural  details  of  the  chemical  compound  involved,  a  semi- 
molecular  formula,  and  melting  and/or  boiling  point  and  a  reference  to  the  lo¬ 
cation  of  the  spectrogram  in  the  literature.  They  attempt  to  include  all  pub¬ 
lished  standard  spectra  including  those  sold  by  commercial  laboratories. 


Wyandotte-ASTM  Spectral  Data  Project 
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Joint  Committee  op  C33.eoai.cal  Analysis  by  Powder  Diffraction  Methods,  Ameri 
can  Society  for  Testing  and  Materials,  Philadelphia,  Pennsylvania. 

The  Joint  Committee  is  sponsored  by  four  international  cooperating  societies: 
The  Institute  of  Physics  (British),  The  American  Crystallographic  Society, 
The  National  Association  of  Corrosion  Engineers,  and  The  American  Society 
for  Testing  and  Materials. 

The  Joint  Committee  has  two  functions:  first,  to  collect  diffraction  data  from 
all  available  sources,  critically  edit  this  material,  publish  the  selected  data 
in  a  convenient  form,  and  provide  suitable  indexes;  secondly,  to  advance  the 
techniques  by  which  this  diffraction  data  can  be  used  for  chemical  identifica¬ 
tion.  Data  are  continuously  collected  for  the  file  in  three  ways:  (1)  abstrac¬ 
ted  from  the  literature;  (2)  sponsored  projects  of  the  Joint  Committee;  and 
(3)  patterns  sent  to  the  Joint  Committee  by  individual  scientists  throughout 
the  world. 

The  Joint  Committee  sponsors  the  production  of  patterns  by  several  groups  in 
the  United  States,  Great  Britain,  The  Netherlands,  Israel  and  Japan,  which 
has  resulted  in  the  rapid  increase  of  high-quality  patterns,  the  main  factor  in 
growth  of  the  file. 

Diffraction  analysis  identifies  a  substance  by  means  of  atomic  arrangement. 

It  is  used  whenever  it  is  necessary  to  identify  the  state  or  combination  of  the 
chemical  elements  or  phases  present.  The  X-ray  diffraction  cards  show 
numerical  values  for  the  powder  pattern  lines,  with  intensities,  value  con¬ 
stants,  space  group  and  other  crystallographic  data,  when  available.  These 
patterns  are  presented  on  three  different  types  of  output:  plain  cards  (3  x  5), 
Key  sort  cards  (4  x  6),  and  magnetic  tapes.  They  also  have  indexes  that  in¬ 
clude  tabular,  books,  IBM  cards,  KW1C  indexes,  magnetic  tapes  and  optical 
coincidence. 
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Electrical-Electronic  Properties  Information  Center,  Hughes  Aircraft  Company, 
Culver  City,  California  (EPIC). 

EPIC  searches  the  open  literature  for  all  articles  dealing  with  the  electronic 
properties  of  materials.  When  located,  such  literature  is  indexed  by  sub¬ 
stance  and  by  one  or  more  of  56  electronic  properties.  From  this  data 
source  file  and  index,  a  bibliography  can  be  produced  that  is  extensively  sub¬ 
ject-indexed.  Their  data  compilation  efforts  are  supported  by  this  data  source 
file.  EPIC’S  principal  data  output  is  in  the  form  of  data  sheets.  After  a  mate¬ 
rial  or  group  of  materials  is  selected  for  compilation  coverage,  an  assigned 
specialist  reviews  all  relevant  documents,  evaluates  them  for  reliability,  and 
then  extracts,  for  re -publication  in  a  concise  format,  the  relevant  data  con¬ 
tent  of  all  reliable  articles  on  the  substance  and  property  chosen.  At  the  dis¬ 
cretion  of  the  assigned  specialist,  findings  are  presented  graphically  or  in  a 
tabular  format  (one  reference  per  page  or  a  comparison  of  references  on  one 
page).  Bound  data  sheets  (in  our  terminology,  monographs)  are  then  issued, 
giving  the  electronic  properties  of  a  substance  or  a  group  of  substances.  State- 
of-the-art  reports  are  also  prepared  by  EPIC,  presenting  experimental  data  on 
all  properties  of  selected  substances.  These  reports  require  searches  beyond 
the  immediate  EPIC  data  source  files  and  indexing. 

EPIC  also  produces  and  distributes  large  computer -gene rated  bibliographies. 
These  may  be  as  by-products  of  a  data  compilation  task,  or  for  service  as 
Interim  Reports  while  final  data  compilation  is  under  way.  Such  Interim  Re¬ 
ports  may  have  some  preliminary  curves  drawn.  Specific  requests  for  data 
and/or  references  are  also  answered.  Large  bibliographies  generated  in 
response  tn  such  requests  are  sometimes  generally  distributed. 
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Electrical -Electronic  Properties  Information  Center 

Hughes  Aircraft  Company 
Culver  City,  California 

Physical  Arrangement  of  Data  Source  Files  (Library) 

The  source  document  files  and  indices  and  their  construction  are  indica¬ 
ted  with  the  proper  label  and  recording  media.  The  following  code  is 
used  to  indicate  each  element  included  in  the  file  and  its  function  as  a 
mode  of  access  for  searching:  P  =  Primary  access  mode;  S  “Secondary 
access  mode;  N  *Not  used  as  an  access  mode. 
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Atomic  and  Molecular  Processes  Information  Center,  Oak  Ridge  National  Labors- 
tory,  Oak  Ridge,  Tennessee. 

The  Atomic  and  Molecular  Processes  Information  Center  evaluates,  reviews  and 
retrieves  data  on  atomic  and  molecular  processes.  Scientific  interest  in  physi¬ 
cal  and  chemical  processes  on  an  atomic  scale  has  been  expanded  from  the 
study  of  laboratory  gas  discharges  to  include  potential  thermonuclear  plasmas, 
r atrophy s leal  systems  and  magneto -hydrodynamic  phenomena,  as  well  as  nu¬ 
merous  gas  and  plasma  systems  not  in  thermodynamic  equilibrium.  Collisions 
of  a  binary  and,  less  frequently,  of  a  ternary  nature  dominate  the  plasma  state 
in  the  fourth  state  of  matter. 

Major  areas  of  interest  are  the  interaction  of  heavy  particles  with  other  heavy 
particles,  electron  and  heavy  particle  interactions  with  macroscopic  matter, 
and  the  effects  of  static  or  quasi-static  electric  and  magnetic  fields  on  heavy 
particles,  especially  those  in  excited  states. 

The  Center  produces  bibliographies  in  these  areas  on  a  yearly  basis  and  is 
also  preparing  critical  reviews  of  published  and  unpublished  data  in  the  fields 
of  ion -atom  rearrangement  collisions  and  of  ionization,  excitation  and  dis¬ 
sociation  by  heavy  particles.  In  addition,  the  Center  answers  inquiries  and 
serves  as  a  clearinghouse  for  technical  information  in  its  subject  field. 
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Atomic  and  Molecular  Processes  Information  Center 

Oak  Ridge  National  Laboratory 
Oak  Ridge,  Tennessee 

Physical  Arrangement  of  Data  Source  Files  (Library) 

The  source  document  files  and  indices  and  their  construction  are  indica- 
ted  with  the  proper  label  and  recording  media.  The  following  code  is 
used  to  indicate  each  element  included  in  the  file,  and  its  function  as  a 
mode  of  access  for  searching:  P  *  Primary  access  mode;  S  *  Secondary 
access  mode;  N  *  Not  used  as  an  access  mode. 
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WORKSHEET  FOR  ANALYSIS  OF  DATA  COMPILATION  ACTIVITIES 


I.  OPERATIONAL  CONTEXT 


A.  Housing  Organization 

1 .  Name  Thermal  Laboratory,  Dow  Chemical  Company.  _ _ 

_ Midland,  Michigan  (JANAF) _ 

2.  Type  of  institution: 

<  )  a.  Governmental  <  )  d.  Professional  Society 

(x)b.  Industrial  (  )  e.  Other  (identify) 

{  )  c.  Educational  _ 

3.  Primary  motivation  for  data  compilation  activities: 

(  )  a.  In-house  support  servicing 

(  x)  b.  External  servicing,  non-profit  *( contractor  operated) 

(  )  c.  External  servicing,  profit  making 

(  )  d.  Other  (identify) _ 

4.  Source  of  funding  for  data  compilation  activities: 

_ %  a  Industry  _ %  d-  Local  and  State 

_ %  b.  Professional  Societies  _ %  e.  Other  (identity) _ 

100  %  c.  Federal  Government  _ 

5.  Percentage  distribution  of  current  data  compilation  efforts  among  the  following 
processing  functions,  e.  g.  , 

a.  _ 3  % 

b.  5  % 

c.  2  it 

d.  65  % 

e.  _ 5  % 

f.  10  % 

g.  10  % 

h.  _ 0> 

i-  0  % 


6.  Identification  of  data  compilations  prepared  by  housing  organization: 
JANAF  Thermochemical  Tables _ _ _ 


for  acquisition  of  data  sources 

for  source  document  screening  and  evaluation 

for  source  document  file  (library)  maintenance 


for  data  screening  and  evaluation 
for  data  extraction 
for  data  file  construction  and  maintenance 


ran  integral  reiterative 
process 


for  compilation  publication  and  dissemination 


for  inquiry  handling 

for  other  processing  function(s)  (Specify) 
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DATA  SOURCE  (LIBRARY)  COLLECTION  AND  PROCESSING 


A.  Identification  and  Acquisition  of  Sources  at  Data 

1.  Which  of  the  following  are  screened  in  searching  for  candidate  document  titles 
or  data  sources? 

(x)a.  Journal  articles.  Approx.  %  5 

<  )  b.  Catalog  cards.  Approx.  % _ . 

(  )  c  .  Bibliographic  citations  in  indexed  documents.  Approx.  % 

(x)  d.  Index  and  abstract  publications,  Approx.  %  5  ^Chemical  Abstracts 

<  x)  e.  Computerized  tape  indexes.  Approx.  %  85  *  Chemical  Titles 

(  )  f.  Punched  card  indexes.  Approx.  % 

( x)  g.  Others  (Identify)  Research  Reports  ,  Approx.  %  S 

2.  Total  number  of  candidate  sources  identified  annually  1,  500 _ . 

3.  Which  of  the  following  data  sources  are  annually  indexed? 

(x)  a.  Journal  articles.  Approx.  %  90 

(x)  b.  Published  reports  other  than  journal  articles.  Approx.  %  10  -*Govt.  research 
{  )  c.  Unpublished  laboratory  reports.  Approx.  %  reports 

(  )  d.  Data  sheets  from  other  compilations.  Approx  % 

(  )  e.  Customized  data  sheets.  Approx.  % _ . 

(  )  f .  Others  (Identify)  ,  Approx.  % _ . 

4.  What  is  the  form  of  data  sources  currently  accessioned? 

(x)  a.  Hardcopy,  Approx.  %  100  (  )  e.  ^Magnetic  tapes.  Approx.  % _ . 

(  )  b.  Microfilm,  Approx.  %_ _ .  <  )f.  Others  (identify) 

(  )  c.  Microfiche,  Approx.  % _ .  ,  App.  % _ 

(  )  d.  Punched  cards.  Approx.  % _ .  _ ,  App.  % _ . 

♦Planning  conversion  to  microcards  .  ^ 

5.  What  is  the  total  number  of  data  sources  indexed  to  date?  26,  000 _ 

6.  What  is  the  period  covered  by  data  sources  indexed  to  date?  1925-66 _ 

7.  What  percentage  of  the  total,  usable  data  sources  issued  during  the  period  of 

coverage  has  been  indexed?  _ >90  % 

8.  What  percentage  of  currently  issued,  usable  data  sources  is  indexed?  ^0% 

9.  What  percentage  of  current  year  accessions  were  issued  or  published  within 

the  past  year?  _ 90  %  *in  abstract  bulletins;  50%  in  original  journal. 


1.  Data  Source  Screening  and  Evaluation 

1.  What  are  principal  criteria  for  screening  of  titles?  The  document  must 
report  measurement  of  property  of  interest  to  Thermal  Laboratory:  no 
substance  screening  is  performed.  Screening  is  done  automatically  bv 
use  of  weighted  key  words. 
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2.  What  percentage  of  initially  considered  titles  is  rejected  without  physical 
examination  of  the  data  source?  >95% 

3.  What  are  the  principal  criteria  for  screening  of  data  sources  ?  Does  the 

candidate  source  contain  new  measurements.  No  substance  screening 
is  performed:  remeasurement  of  a  property  value  is  accepted. _ 


4.  What  percentage  of  data  sources  physically  examined  is  rejected? _ 50% 


C.  Data  Source  Cataloging  and  Indexing 


1. 


Indicate  the  extent  to  which  the  following  modes  of  cataloging  and  indexing 
are  used. 


a.  b.  c.  d. 


Means  of  Indexing 
or  Cataloging 

Total  #  of 
such  des¬ 
criptors 
used  to  date 

Current  sum. 
rate  of  addi¬ 
tion  of  new 
descriptors. 

Av.  #  of  des¬ 
criptors  as¬ 
signed  to  each 
source  doc. 

Cur.  annual 
growth  of 
descriptor 
entries 

Substance  Ident. 

Property  Ident. 

Type  of  Measrmt. 

Others  (identify) 

Substance  Charac. 

IRRIRVil 

apprx.  1, 000 

70 

18 

9 

Total  of  5 

7,  500  entries 

2.  Indicate  which  means  are  used  to  index  the  substance  coverage  of  data  sources 
(  )  a.  Trade  name 

(  )  b.  Trivial  name  of  compounds,  complexes,  systems,  etc. 

(x)  c.  C.  A.  or  other  systematic  chemical  nomenclature 
(  )  d.  Chemical  fragment  codes. 

(  )  e.  Others  (identify) _ 
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3.  What  nomenclature  standards,  thesauri,  code  manuals,  etc.  are  used  to 
control  indexing  of  substance  coverage?  Modified  Hill  Indexing  System. 
Jrl.  Americ 


4.  What  means  are  used  to  index  the  property  coverage  of  data  sources? 


—  11  '  "w1 1 1"  sg ||-' I V a Ti'iijBi 


or  data  comoilation  techniques  and 


the  state  or  condition  of  the  substance  ore  indexed  by  memonic  3-  or 


4 -letter  codes. 


D.  Physical  Arrangement  of  Data  Source  Files  (Ldbr 


1.  To  indicate  how  the  source  document  file  (library)  and  indices  are  constructed, 
label  each  file  and  give  its  recording  media  (e,  g. ,  hardcopy,  microfilm,  punched 
cards,  etc.  ).  Then  use  the  following  code  to  indicate  for  each  element  included 
in  the  file  its  function  as  a  mode  of  access  for  searching:  P  *  Primary  access 

mode,  S  *  Secondary  access  mode:  N  *  Not  used  as  an  access  mode. 

1925-62  1962-66  Property-  Data  Basic 

eg  Doc,  ateignffg  Source  .  -  - 

File  File  File.  TBS  File 


Recording  Media 

File  Element 

Source  Doc. 

Doc.  Abstracts 

Personal  Auth. 

Journal  Source 

ource 

Date  of  public1  n. 

Accession  Number 

Substance  Ident. 

Property  Ident.*-*^,  .  . 

(comb. ) 

Type  of  measrmt^ 
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B.  Purging  of  Data  Source  File  (Library) 

1.  Indicate  the  method  of  purging  data  source  file  (library). 

(x )  a.  File  is  not  purged 

l  }  b.  Source  documents  of  &  specified  age  (e,  g. ,  10  years)  are  purged 
<  )  c.  Source  documents  are  purged  when  more  reliable  sources  become 
available. 

(  )  d.  Source  documents  are  purged  when  their  pertinent  data  is  incorporated 
into  the  data  file. 

2.  How  many  data  sources  are  annually  purged  from  the  file?  none _ 

F.  If  this  data  source  file  (library)  supports  more  than  one  data  file  or  compila¬ 
tion,  identify  Hie  file  and  compilations).  Dow  Chemical  Company  internal 
_ thermodynamic  data  file _ 
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G.  Flow  Chart  the  Step*  Involved  in  Pet*  Source  File  Construction  and  Maintenance 


(See  pages  30-31. ) 
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III.  DATA  FILE  CONSTRUCTION  *  (combined  Data  Source  &  Data  File) 


A.  Screening  and  Evaluation 

1.  What  criteria  are  used  to  select  data  for  inclusion  in  the  data  file? 
(x)  a.  None,  except  that  the  data  fall  within  the  scope  of  the  data 
file  and  are  from  an  apparently  reliable  source. 

(  )  b.  Data  which  meets  the  file's  criteria  for  substance  purity, 
measurement  reliability,  etc. 

(  )  c.  Critically  qualified  data  which  meets  the  "one  best  estimate" 
value  for  the  property. 

(  )  d.  Other  (specify) _ 


2.  When  is  data  screened  for  exclusion  from  the  data  file? 

(x)  a.  Prior  to  extraction  from  the  source  document. 

(  )  b.  Subsequent  to  extraction  from  the  source  document. 

3.  What  percentage  of  source  documents  screened  provide  data  usefu1  in 

data  file  construction?  >90% 


B.  File  Accumulation  and  Purging 

1.  What  is  the  earliest  published  date  covered  by  data  items  processed  to 

present?  1925  _ _ 

2.  What  is  the  total  number  of  data  sources  processed  to  date,  i.  e. ,  what  is 

the  total  number  of  documents  or  data  sources  from  which  data  have  been 
extracted?  25  0QQ _ 

3.  What  percentage  of  potentially  useful  data  sources  (see  item  II-A-S)  has 

been  screwed  for  processing  of  data  into  the  file? _ 100 % 

4  What  combinations  of  substances  and  properties  are  normally  found  in  sourc  ? 
documents  ? 

70%  Single  substance  90%  Single  property 

30%  Multiple  substances  10%  Multiple  properties 

5.  What  is  the  current  annual  rate  of  processing  data  into  the  file? 

1,  5QQ _ additional  source  items  included 

1,  000  apprx. additional  substances  included 
0 _ additional  properties  included 

_ NA  _ new  values  (i.  e.  ,  for  substances  and/or  properties  not  pre¬ 
viously  included) 

Na _  additional  or  updated  values  for  substa.  ces  and  properties 

previously  included 
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6.  What  percentage  of  data  extracted  is  in  each  of  the  following  forms? 

85  %  Digital  or  alphanumeri  c  5  %  Formula  *  (equations) 

10  %  Graphic  _ %  Other  (identity) 


7.  What  is  the  frequency  of  updating  of  the  data  file? 
(x)  a.  Continuously 

(  )  b.  At  intervals  (describe  interval  and  basis) 


8-  How  are  data  files  purged? 

(  )  a.  Files  are  not  purged.  ^to  data  source  file) 

(x)  b.  Property  values  are  retired  when  more  reliable  v  iues  become 
available. 

(  )  c.  Files  are  retired  when  a  data  compilation  is  published. 

(  )  d.  Other  (identify) _ 


C.  Chemical  Substance  Coverage 

1.  What  is  the  intended  substance  coverage  for  this  file?  All  chemical 
compounds  of  interest  to  the  Dow  Chemical  Company _ 


2.  What  is  the  total  number,  to  date,  of  chemical  substances  included  in  this 

data  file  ?  agPrgg,..  JjLPOj) - • 

3.  What  is  the  distribution  of  coverage  among  the  following  groups  of  chemical 
substances? 

<5  %  Chemical  Elements 

_ %  Single  Chemical  Elements 

_ %  Elemental  Mixtures  &  Systems  (including  Alloys) 

apprx.  47.  5  %  Inorganic  Compounds,  Mixtures  A  Systems 
_ %  Inorganic  Compounds 

_ %  Inorganic  Mixtures  &  Systems  (including  Solutions) 

apprx.  47.  5  %  Organic  Compounds,  Mixtures  it  Systems 
_ %  Organic  Compounds 

_ %  Organic  Mixtures  &  Systems  (including  Solutions) 

_ 0_%  Genericaliy  Identified  Substances 

_ %  Polymers,  Rubbers,  Etc, 

_ %  Systems,  Composites 

_ _ %  Natural  Products 

_ %  Minerals 

_ %  Refractories 

_ %  Other _ _ _ 

_ %  Other _ _ _ 

_ %  Other _ _ 

_ %  Other _ 

%  Other 


Solano*  Communication 

Washington,  D.  C. 

Final  Report/ 3 -20-67 


4.  Identify  the  different  phases,  forms,  concentrations  etc. ,  of  substances 
included  in  the  data  file.  Solid,  liquid,  gaseous _ 


S.  If  this  effectively  increases  the  number  of  substances  covered  in  the  file, 
indicate  the  percentage  increase  over  item  C-2.  _ 100  % 
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D.  Property  Coverage 

1.  What  is  the  intended  property  coverage  for  this  file?  Thermocbemicnl 
and  related  fundamental  properties _ 


2.  What  is  the  total  number,  to  date,  of  properties  included  in  this  data  file? 

_ 70 _ . 

3.  How  many  new  properties  are  annually  added  to  the  file?  _ 0 _ . 

4.  What  is  the  average  number  of  values  per  property- substance  included  in 
the  file?  N.  A. 

5.  What  factors  determine  the  number  of  values  compiled  for  each  property? 

The  number  of  reputable  values  which  can  be  located  in  the  litera- _ 

ture  or  calculated  from  basic  data _ 


E.  If  this  data  file  supports  more  than  one  data  compilation,  identity  the  compil 
ations .  JANAF Thermochemical  Tables.  Dow  Thermochemical  Data  Files 


F.  File  Formatting 

1.  is  the  source  document  format  retained  or  is  the  format  converted  to  a 
standardized  format?  (x)  Format  retained,  (x  )  Format  converted. 

90%  10% 

2.  What  is  the  current  volume  of  the  file?  8,  000  File  Units. 

N.A.  Data  Records. 

3.  Use  the  following  table  to  indicate  the  manner  of  recording  information  in 
the  data  file  or  files. 

This  file  is  an  unstructured  data  source  and  working  file  which  contains 
source  documents,  extracts,  computations  and  JANAF  data  sheets  for 
the  substance.  The  primary  entry  point  is  chemical  substance. 
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Length  of 

Element 

(Characters) 

— • 

j 

Recording 

Media 

1 

!|§$s| 

B9KS 
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mmm 

BUI 

|HI 

Type  of 
Represen¬ 
tation 

Number/ 

File  (pages) 

O 

_a 

No.  of  Line 
Entries  or 
File  Unit 

UNSTRUCTU 

-  — 

n- — 

Type  of 
Value 

♦  FILE  li 

Sequence  of 
Properties 
(File  Level) 

I 

File  Element 
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G*  Flow  Chart  the  Step#  Involved  in  the  Construction  and  Maintenance  of  the 
Data  File. 


(See  pages  30-31. ) 
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III.  DATA  FILE  CONSTRUCTION  *  (Slave  File) 


A.  Screening  and  Evaluation 

1,  What  criteria  are  used  to  select  data  for  inclusion  in  the  data  file? 

(  )  a.  None,  except  that  the  data  fall  within  the  scope  of  the  data 
file  and  are  from  an  apparently  reliable  source. 

(  )  b.  Data  which  meets  the  file's  criteria  for  substance  purity, 
measurement  reliability,  etc. 

(x)  c.  Critically  qualified  data  which  meets  the  "one  best  estimate'* 
value  for  the  property. 

(  )  d.  Other  (specify) _ 


2.  When  is  data  screened  for  exclusion  from  the  data  file? 

(  )  a.  Prior  to  extraction  from  the  source  document. 

{  x)  b.  Subsequent  to  extraction  from  the  source  document. 

3.  What  percentage  of  source  documents  screened  provide  data  useful  in 

data  file  construction?  _ 10% 

B.  File  Accumulation  and  Purging 

1 .  What  is  the  earliest  published  date  covered  by  data  items  processed  to 

present?  1925 _ 

2.  What  is  the  total  number  of  data  sources  processed  to  date,  i.  e. .  what  is 

the  total  number  of  documents  or  data  sources  from  which  data  have  been 
extracted?  2500 _ 

3.  What  percentage  of  potentially  useful  data  sources  (see  item  II-A-6)  has 

been  screened  for  processing  of  data  into  the  file?  _ >95  % 

4.  What  combinations  of  substances  and  properties  are  normally  found  in  source 
documents  ? 

80%  Single  substance  80%  Single  property 

20%  Multiple  substances  20%  Multiple  properties 

5.  What  is  the  current  annual  rate  of  processing  data  into  the  file? 

_ 150  additional  source  items  included 

_ 100  additional  substances  included 

_ 0  additional  properties  included 

_ N.  A.  new  values  (i.  e.  ,  for  properties  not  pre¬ 
viously  included) 

_ N.  A,  *  additional  or  updated  values  for  substances  and  properties 

previously  included. 

^properties  updated  for  100  substances 


Washington,  o.  c. 

Final  Report/ 3-20-67 


NSF  C-478 
-D-xiii- 


6.  What  percentage  of  data  extracted  is  in  each  of  the  following  forms? 
85  %  Digital  or  alphanumeric  5  %  Formula 

10  %  Graphic  _ %  Other  (identify) _ 


7,  What  is  the  frequency  of  updating  of  the  data  file? 

(  )  a.  Continuously 

<x)  b.  At  intervals  (describe  interval  and  basis)  Quarterly 


8.  How  are  data  files  purged? 

(  )  a.  Files  are  not  purged. 

(x )  b.  Property  values  are  purged  when  more  reliable  values  become 
available. 

{  )  c.  Files  are  retired  when  a  data  compilation  is  published. 

(  )  a.  Other  (identify) _ _ 


I 

i 

I 

i 

I 

1 

I 


C.  Chemical  Substance  Coverage 


1.  What  is  the  intended  substance  coverage  for  this  file?  Light  metal 
compounds.  iaclu4iflg.h.Y<jn^S-..  nitrides,  Jfealideg,  etc. _ 


2.  What  is  the  total  number,  to  date,  of  chemical  substances  included  in  this 

data  file?  625 _ . 

3.  What  is  the  distribution  of  coverage  among  the  following  groups  of  chemical 
substances  ? 

_ 20  %  Chemical  Elements 

%  Single  Chemical  Elements 

_ %  Elemental  Mixtures  A  Systems  (including  Alloys) 

_ 80  %  Inorganic  Compounds,  Mixtures  A  Systems 

_ %  Inorganic  Compounds 

_ %  Inorganic  Mixtures  A  Systems  (including  Solutions) 

%  Organic  Compounds,  Mixtures  A  Systems 
_ %  Organic  Compounds 

_ %  Organic  Mixtures  A  Systems  (including  Solutions) 

_ %  Genericaily  Identified  Substances 

_ %  Polymers,  Rubbers,  Etc. 

_ %  Systems,  Composites 

_ %  Natural  Products 

_ %  Minerals 

_ %  Refractories 

_ "h  Other _ _ 

_ %  Other _ _ 

_ %  Other  ' 

_ %  Other  __^IIZZZIZIZZZZIIZZZZZI 

%  Other 
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I 

I 

I 

I 

I 

I 

I 

I 
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4.  Identify  the  different  phases,  forma,  concentrations,  etc. ,  of  substances 
included  in  the  data  file.  Sim£l2Si^LSS^SSm.  69Ud  _ 


5.  If  this  effectively  increases  the  number  of  substances  covered  in  the  fibs, 
indicate  the  percentage  increase  over  item  C-2.  _ ggjfc 


D.  Property  Coverage 

1.  What  is  the  intended  property  coverage  for  this  file?  ffasic 
dynamic  and  specrographic  data  required  to  compute  <1ANAF  tables. 


2.  What  is  the  total  number,  to  date,  of  properties  included  in  this  data  file? 

20 _ . 

3.  How  many  new  properties  are  annually  added  to  the  file? _ g _ . 

4.  What  is  the  average  number  of  values  per  property- substance  included  in 

the  file?  1 _ . 

5.  What  factors  determine  the  number  of  values  compiled  for  each  property? 

number  required  to  compute  thermodynamic  fimetiVmg _ 


E.  If  this  data  file  supports  more  than  one  data  compilation,  identify  the  compil¬ 
ations.  _ Nf  A, _ 


F.  File  Formatting 

1 .  Is  the  source  document  format  retained  or  is  the  format  converted  to  a 
standardized  format?  (  )  Format  retained,  (x)  Format  converted. 

2.  What  is  the  current  volume  of  the  file?  950  File  Units. 

Data  Records. 


3.  Use  the  following  table  to  indicate  the  manner  of  recording  information  in 
the  data  file  or  files. 
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G.  Flow  Chart  the  Steps  involved  in  the  Construction  and  Maintenance  of  the 
Data  File. 


(See  pages  30-31.) 
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USER  SERVICING 

A.  Service  Philosophies  and  Emphases 

1.  How  are  the  following  service  features  rated  in  your  current  operations?  Use 
the  following  scale,  or  comment: 

P  *  A  primary  objective 
S  *  A  secondary  objective 

O  *  Desirable,  but  subordinated  by  other  factors 
N  *  Not  a  pertinent  objective 
U  *  Undesirable  objective 

(N)  a.  Collect  and  disseminate  all  data  in  area  covered. 

(P)  b.  Collect  all  data  and  disseminate  only  selected  data  in  area  covered. 

(N)  c.  Collect  and  disseminate  only  selected  data  (e.  g. ,  most  probable 
values)  in  area  covered. 

(S )  d.  Provide  user  with  references  to  sources  of  data. 

(N)  e.  Provide  user  with  data  usable  and  with  reference  to  information  sources. 

(P)  f .  Provide  user  with  data  usable  without  reference  to  information  sources. 

(D)  g.  Provision  of  quick  response  to  individual  queries. 

(P)  h.  Periodic  dissemination  of  accumulated  information  or  data. 

(D)  i .  Fast  processing  of  newly  available  data. 

(S )  j  .  Fast  dissemination  of  newly  available  data. 

(P)  k.  Reduce  costs  for  user  to  obtained  references  and/or  data. 

{S )  1 .  Increase  utility  of  data  by  conversion  to  a  more  useful  format  (e.  g. , 
graphical  presentations,  or  ADP  forms  such  as  punched  cards. 

2.  Who  are  the  users  of  the  products  of  this  data  compilation  activity? 

(x )  a.  Basic  Scientists _ 25  % 

(x  )  b.  Applied  Technologists _ 75% 

(  )  c.  Others _ %  (Specify)^ _ 

(x  )  d.  Specific  technical  communities  (Please  describe:  if  their  needs  have  / 

influenced  your  service  criteria,  please  comment:) _ 

_ rocket  .Brapulgtan  specialists _ 


(  )  e.  Can  you  characterize  the  users  more  definitively,  e.  g. ,  how  they  use 
the  data,  the  estimated  (1)  active,  and  (2)  potential  size  of  the 
community  or  communities,  etc?  (for  each  compilation  prepared): 
The  JANAF  Thermochemical  Tables  serve  as  the  authority  for 

thermnchemical  data  for  uae  in  rocket  propellant  performance 

_ calculations.  The  Tables  have  been  distributed  internationally 

_ and  serve  a  broad  spectrum  of  basic  scientists  and  applied _ 

_ technologists. - - 
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Activity’s  current  processing  time  from  data  source  identification  to  user 
availability. 


a. 


N,  A. 


b. 

c. 

d. 


N.  A. 


N.  A. 


Averages  6  months 


for  bibliographic  referencing  in  response  to 
query. 

for  processed  data  in  response  to  an  inquiry, 
for  appearance  in  accession  lists  or  bibliographies, 
for  distribution  in  data  compilation  (specify  format) 


B.  Types  of  Output  Prepared 

1.  Indicate  the  distribution  of  volume  of  output: 

_ %  Data  source  reference  without  data  extracts. 

%  Data  extracts  without  data  source  references. 

156%  Data  extracts  and  data  source  references. 

%  Other  (identify) _ _ 

2.  Indicate  the  distribution  of  output  according  to  the  type  of  requirement: 
_ %  Generated  in  response  to  inquiries. 

100%  Generated  for  periodic  publication. 

_ %  Other  (Specify) _ 


C.  Usage  of  Data  Source  Files  (Library) 

1. 

i. 

i 

s 


i 


I 

i 


Indicate  the  frequency  of  usage  of  the  data  source  file  (library). 


Type  of  File  Search 

Number  of 
Searches /Year 

Searches  in  response  to  external  queries  for  document 
references. 

None 

Periodic  searches  for  preparation  or  updating  of  a  bib¬ 
liography  or  accession  list  for  external  distribution. 

None 

Searches  by  staff  to  identify  sources  of  data  required 
to  answer  data  queries. 

None 

Searches  by  staff  to  identify  sources  of  data  required 
to  prepare  or  update  data  compilations 

file  folders. 
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2,  Provide  the  following  information  for  accession  lists  or  bibliographies 
prepared  and  disseminated.  NONE  PUBLISHED. 


Accession  Lisx 
or  Bibliography 

Frequency  of 
Publication 

Forms  Distributed 

Type  of  Indexing 

 J 

or  Updating 

3.  What  criteria  are  used  to  select  data  sources  for  inclusion  in  bibliographies? 
NONE  PUBLI 


4.  Is  the  format  of  the  data  source  file  (library)  normally  retained  or  modified 
for  bibliography  publication?  N.A. 

(  )  a.  Retained  (  )  b.  Modified 


D.  Usage  of  Data  File 


Type  of  File  Search 


Searches  to  obtain  data  required  to 
answer  the  data  queries. 

Searches  to  obtain  data  required  to 
prepare  or  update  data  compilations 


Number  of 
Searches/Year 


of  a  computation 
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Provide  the  Following  Information  for  Each  Compilation  Where  the  Information  is 

Different  From  the  Data  File 


Identity  of  Compilation  JANAF  Thermochemical  Tables 


F.  Screening  and  Evaluation 

1.  What  criteria  are  used  to  select  data  for  inclusion  in  the  compilation? 

(  )  a.  None,  except  that  the  data  fall  within  the  coverage  of  the  data 
compilation  and  are  from  an  apparently  reliable  source. 

(  )  b.  Data  which  meet  the  compilation's  criteria  for  substance  purify, 
measurement  reliability,  etc. 

(x)  c.  Critically  qualified  "one  beat  value". 

(  )  d.  Other  (identify) _ 


2.  What  percentage  of  data  pulled  from  the  data  file  is  normally  accepted 

for  inclusion  in  the  data  compilation?  iQQ  %  *(part  of  table  computation) 

3.  What  percentage  of  potentially  useful  data  items  in  the  data  file  have 

been  considered  for  inclusion  in  the  data  compilation?  >95  % 

4.  What  is  the  current  annual  rate  of  issuing  data  in  compilations? 

150 _ additional  source  items  included 

100 _ additional  substances  included 

_ 0 _ additional  properties  included 

_ 0 _ new  values  (i.  e. ,  for  substances  and/or  properties  not  pre¬ 
viously  included) 

100 _ additional  or  updated  values  for  substances  and  properties 

previously  included 

_ other  additions  or  changes  (please  identify) 


G.  Chemical  Substance  Coverage 

1.  What  is  the  intended  substance  coverage  of  this  compilation?  Light 

elements  and  their  oxides,  fluorides,  chlorides,  nitrides,  carbides. 
hydrides,  etc. _ 
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2*  Whet  is  the  total  number,  to  date,  of  chemical  substances  included  in  thie 
*****  compilation?  62 S  (estimate) 

3.  What  ie  the  distribution  of  coverage  among  the  following  groups  of  chemical 
substances? 

_ 10  %  Chemical  Elements 

_ %  Single  Chemical  Elements 

_ %  Elemental  Mixtures  4  Systems  (including  Alloys) 

90%  Inorganic  Compounds,  Mixtures  A  Systems 
%  Inorganic  Compounds 

%  Inorganic  Mixtures  A  Systems  (including  Solutions) 

%  Organic  Compounds,  Mixtures  &  Systems 
%  Organic  Compounds 

_ %  Organic  Mixtures  4  Systems  (including  Solutions) 

%  Generically  Identified  Substances 

_ _ %  Polymers,  Rubbers,  Etc. 

^ _ %  Systems,  Composites 

_ Natural  Products 

%  Minerals 

_ %  Refractories 

_ %  Other _ _ 

_ %  Other _ 

_ %  Other _ _ 

%  Other 

_ %  Other 

4.  Identify  the  different  Phases,  forms,  concentrations,  etc.  of  substances 

included  in  the  data  file .  .  Itattid,  ggfiggltt _ 
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$.  If  this  effectively  increases  the  number  of  substances  in  III-G-2,  give 
the  percentage  increase  over  item  C-2.  _ 32  % 

H.  Proper^-  Coverage 

1.  What  is  the  intended  property  coverage  for  this  compilation?  Heal 

capacity,  entropy,  free  energy  function,  enthalpy,  heat  of  formation. 
equilibrium  constant _ 


2.  What  is  the  total  number,  to  date,  of  properties  included  in  this  compilation? 
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3.  What  is  the  average  number  of  value*  per  property ' substance  in*  laded  in  the 
compilation?  61 

4.  What  factors  determine  the  number  of  values  published  for  each  property? 

A  value  is  pubIighed  for  each  iOO°C  Temperatuge  interval  betyeen  _ 

0°K  and  6000** K;  when  a  substance  experiencesa  'phase  change  over,.  ~ 
this  temperature  range.  • 


I.  Compilation  Formatting 

1.  Is  the  data  file  format  retained  or  is  the  format  converted  to  a  different 
publication  or  dissemination  format? 

(  )  a.  Data  file  format  retained  (x  )  b.  Format  converted 

2.  What  is  the  current  volume  of  the  data  compilation? 

_ 9S0  data  sheets,  etc. 

_ information  units 

3.  Use  the  lollowing  table  to  indicate  the  manner  used  to  publish  or. 

disseminate  the  data  compilation.  - 
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Output  Document  Designation: _ JANAF  Thermochemical  data  (p.  4  of  5) 


Output  Document  resignation:  JANAF  Th.rmoc!i.m,c,i  Da: 


At  the  document  level,  >  basic  constants,  6  derived  constants,  4  defined  constants  and  8  auxilary  constants  are 
defined  in  terms  cf  name,  symbol,  value,  units,  and  uncertainty 


Construction  and  Maintenance  of  Data  Source  and  Data  i 


Scisineo  Communication 

iV&tthington.  O  C  200  07 


1  i-’ n-ial  Ifvpori 

.:<!  ;vi ai  oi>  i 


-D-xxx 


3ol«nce  Communication 

Washington.  D.  C.  200  07  NSF  C-478 

l-'mal  R.  pol  l  /  3  20-  67  -D-xxxii~ 

;\n  <  xt*  nBive,  iterative  computing  routine  iB  employed  to  compute  the  tables 
of  tii.  '  inoiiy mimic  functions.  The  input  consists  of  the  constants  and  funda- 
m.  tital  data  stirred  in  the  basic  data  magnetic  tape  file  and  a  stored  program 
to'  processing  this  data.  Preparation  of  a  detailed  flow  chart  of  the  compu- 
t at.ionaJ  procedure  does  not  fall  within  the  scope  of  our  current  effort. 

The  imal  tables  are  prepared  by  typing  and  photo-reproduction  from  the 
computer  printouts.  Recently,  a  successful  trial  was  made,  jointly  with  the 
National  Bureau  of  Standards,  to  automatically  compose  and  print  the  data 


sheets. 
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V.  GENERAL  INFORMATION 

A.  Describe  significant  historical  changes  in  subject  scope,  users  served, 
services  offered,  operational  mechanics,  revenue  sources,  etc. 

_ The  JANAF  Thermochemical  Tables  activity  has  achieved  a  stable  level 

_ of  operation.  Future  changes  are  expected  to  represent  refinement  of 

_ methods  and  introduction  of  improved  processing  techniques.  The  ob- 

_ iectives,  scope,  etc. .  of  the  compilation  activity  appear  stable  for  the 

near  future. _ ■ _ 


B.  Indicate  present  and  potential  opportunities  and  problems  related  to  recent 
or  tuture  operations  of  this  activity,  especially  as  it  relates  to  use  of  auto¬ 
matic  data  processing. 

_ 1)  Recently  a  prototype  supplement  to  the  JANAF  Thermochemical 

_ Tables  was  prepared  by  computer-aided  photocomposition.  This 

_  prototype  was  prepared  with  the  assistance  of  the  National  Bureau 

_ of  Standards  (NBS).  NBS  equipment  was  used, _ 

2)  The  Thermal  Laboratory  is  currently  shifting  from  maintaining - 

_ hard  copies  of  data  sources  to  microcard  copies. 


C.  Attach  statistical  information,  such  as  size  of  document  collection,  system 
usage  volume,  personnel,  budget  experience,  etc. 

Current  budget:  $200,  000  per  year 
Staff:  6  professionals 
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ANALYSIS  OF  DATA  COMPILATION  ACTIVITIES 


(TPRC )  1.  OPERATIONAL  CONTEXT 

(TPRC)  A.  Housing  Organization 

1.  Name:  Thennophysical  Properties  Research  Center,  Purdue 
University,  Lafayette,  Indiana.  (NSF  C-U78) 

2.  Type  of  institution:  Educational 

3-  Primary  motivation  for  data  compilation  activities: 

Primarily  for  in-house  support  servicing;  secondarily  for 
external  servicing,  non-profit. 

4.  Source  of  funding  for  data  compilation  activities:  100# 
federal  government. 

5.  Percentage  distribution  of  current  data  compilation  efforts 
among  the  following  processing  functions:  35*5#  for 
acquisition  of  data  sources,  source  document  screening  and 
evaluation,  and  source  document  file  maintenance;  57*5#  for 
data  extraction,  screening  and  evaluation  and  for  compilation 
publication  and  dissemination;  7#  for  inquiry  handling. 

(TFRC)  II.  DATA  SOURCE  (LIBRARY)  COLLECTION  AND  PROCESSING 

(TPRC)  A.  Identification  and  Acquisition  of  Sources  of  Data 

1.  The  following  are  screened  in  searching  for  candidate 
document  titles  or  data  sources:  65#  Journal  articles, 

4#  bibliographic  citations  in  indexed  documents,  30#  index 
and  abstract  publications,  1#  others. 

2.  Total  .lumber  of  candidate  sources  identified  annually: 
Approximately  9,200. 

3.  The  following  data  sources  are  annually  indexed:  7 6# 

Journal  articles,  24#  published  reports  other  than  Journal 
articles  (Journal  articles  may  be  held  in  the  form  of  an 
abstract  only,  until  original  document  can  be  obtained). 

4.  What  is  the  form  of  data  sources  currently  accessioned’/ 

80#  hardcopy,  20#  microfiche,  as  received. 

5.  What  is  the  total  number  of  data  sources  indexed  to  date? 
33,700  indexed,  40,000  accessioned 

(TPRC)  B.  Data  Source  Screening  and  Evaluation 

1.  v/hat  percentage  of  initially  considered  titles  is  rejected 
without  physical  examination  of  the  data  source?  Not 
applicable  -  abstracts  or  Journals  scanned  directly. 


via  re  ;■  1007 

-hint  Percentage  of  data  sources  physically  examined  is 
rejected/  505$  up  to  now,  to  go  down  with  new  accession* 
acreening  procedure  of  direct  Journal  scanning. 

(7‘iHC;  c.  Data  dourc.  Cataloging  and  Indexing 

s .  lridicute  the  extent  to  which  the  following  modeB  of 
cataloging  and  indexing  arc  used: 

75,116  substance,  7  property,  7  treatment  of  subject, 

9  substance  gtate,  and  9  language  of  article  descriptors 
are  used  to  date. 

0.  Rate  of  addition  of  substance  descriptors  is  iot 

available,  property,  subject  treatment,  substance  state, 
and  language  descriptors  are  fixed  in  number. 

c.  An  average  of  2  1/2  to  3  substance  and  property 
descriptors  assigned  to  each  source  document,  1  subject, 

1  substance  state  and  1  language  descriptor. 

d.  Approximately  12,650. 

2.  Indicate  which  means  are  used  to  index  the  substance  coverage 
of  data  sources:  C.A.  or  other  systematic  chemical 
nomenclature,  in  combination  with  organization  rules  based 
on  chemical  composition  and  classes  of  substances. 

(TPRC)  d.  purging  of  Data  Source  File  (Library) 

1.  How  many  data  sources  are  annually  purged  from  the  file? 

File  Is  not  purged. 

(TpRC)  III.  DATA  FILE  CONSTRUCTION 

(TPRC)  A.  Screening  and  Evaluation 

1.  What  criteria  are  used  00  select  data  for  inclusion  in 
the  data  file?  Hot  applicable. 

2.  What  percentage  of  source  documents  screened  provide  data 
useful  in  data  file  construction?  Not  applicable. 

(TFRC)  B.  File  Accumulation  and  Purging 

1.  What  is  the  total  number  of  data  sources  processed  to  date, 
i.e.,  what  is  the  total  number  of  documents  or  data  sources 
from  which  data  have  been  extracted?  Not  applicable. 

2.  What  is  the  current  annual  rate  of  processing  data  into 
the  file?  Not  applicable. 

3  What  percentage  of  data  extracted  is  in  each  of  the  following 
forms?  20#  digital  or  alphanumeric,  80#  graphic  data,  as 
presented  in  the  open  literature. 

(TPRC)  C.  Chemical  Substance  Coverage 

1.  What  is  the  total  number,  to  date,  of  chemical  substances 
included  in  this  data  file?  Not  applicable. 


C--KH  1'  inal  Report 
20  vlurcii  1907 


-E-iii- 


2.  What  is  the  distribution  of  coverage  among  the  following 
groups  of  chemical  substances?  Not  applicable. 

(TPHC)  D.  Property  Coverage 

1.  What  is  the  total  number,  to  date,  of  properties  included 
In  this  data  file?  Not  applicable. 

2.  What  is  the  average  number  of  values  per  property-substance 
Included  in  the  file?  Not  applicable. 


(T?RC)  E.  File  Volume 

1.  What  is  the  current  volume  of  the  file?  Not  applicable. 

(TFRC)  IV.  USER  SERVICING 

(TFRC)  A.  Service  Philosophies  and  Emphases 

1.  How  are  the  following  service  features  rated  in  your 

current  operations?  Use  the  following  scale,  or  canment: 

P  -  A  primary  objective 
3  -  A  secondary  objective 
D  -  Desirable,  but  subordinated  by  other 
faetora 

N  -  Not  a  pertinent  objective 
U  -  Undesirable  objective 

(P)  a.  Collect  all  date  and  disseminate  moderately  evaluated 
data  in  area  covered. 

(p)  b.  Collect  all  data  and  disseminate  highly  evaluated  or 
selected  data  in  area  covered. 

(N)  c.  Collect  end  disseminate  only  selected  data  (e.g., 
most  probable  values )  in  area  covered. 

(P)  d.  Provide  user  with  references  to  sources  of  data. 

(p)  e.  Provide  user  with  data  and  with  information  source 
references . 

(N)  f.  Provide  user  with  data  without  information  source 
references. 

(P)  g.  Provision  of  quick  response  to  individual  queries. 

(p)  h.  Periodic  dissemination  of  accumulated  Information 
or  data. 

(D)  1.  Fast  processing  of  newly  available  data. 

(D)  J.  Fast  dissemination  of  newly  available  data.* 

(P)  k.  Reduce  costs  for  user  to  obtain  references  and/or 
data. 

(P)  1.  Increase  utility  of  data  by  conversion  to  a  more 
useful  format  (e.g.,  graphical  presentations,  or 
ADP  forms  such  as  punched  cards. ) 

(TFRC)  B.  Types  of  Output  Prepared 

1.  Indicate  the  distribution  of  output  according  to  the  type 
of  requirement :  Not  applicable. 

(TPRC)  C.  Usage  of  Data  Source  Files  (Library) 

1.  Indicate  the  frequency  of  usage  of  the  data  source  file 
(library):  Daily  manual  searches  by  staff  in  response  to 
external  queries  for  document  references  or  to  identify 
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aources  of  data  required  to  answer  data  queries;  Periodic 
a ear che s  every  12  to  18  months  for  preparation  or  updating 
of  a  bibliography  or  accession  list  for  external  distri¬ 
bution  (Retrieval  Guide);  continuous  searches  by  staff  to 
identify  sources  of  data  required  to  prepare  or  update 
data  compilations. 

(TPRC)  D,  Usage  of  Data  File 

Not  applicable. 

(TPRC )  E.  Publication  Statistics 

a.  TPRC  Data  Book,  i960,  3,322,  semi-annual,  11  x  17  Inch 
looseleaf  pages,  approximately  400  sets. 

b.  Standard  Reference  Data  monograph  publications  •  not 
applicable. 

(TPRC)  F.  Screening  and  Evaluation 

TPRC  Data  Book 

1.  What  criteria  are  used  to  select  data  for  inclusion  in  the 
compilation?  None,  except  that  the  data  fall  within  the 
coverage  of  the  data  compilation  and  are  from  a  reliable 
source;  critically  qualified  "recommended  values". 

2.  What  percentage  of  date  pulled  from  the  data  file  is 
normally  accepted  for  inclusion  in  the  data  compilation? 

Not  applicable. 

3.  What  is  the  current  annual  rate  of  issuing  data  In  compilations? 
1,391  additional  source  items  Included,  3,725  additional 
specimens  Included,  0  additional  properties  included, 

1086  pages  on  new  substances  and/or  properties  Included, 
and  642  additional  or  updated  pages  for  substances  and 
properties  previously  covered. 

(TPRC)  G.  Chemical  Substance  Coverage 

1.  What  is  the  total  number,  to  date,  of  chemical  substances 
Included  in  this  data  compilation?  7,700  specimens, 
substances  not  applicable. 

2.  What  is  the  distribution  of  coverage  among  the  following 
groups  of  chemical  substances?  Not  available. 

(TPRC)  K.  Property  Coverage 

1.  What  is  the  total  number,  to  date,  of. properties  Included 
In  this  compilation?  5. 

2.  What  is  the  average  number  of  values  per  property- 
substance  Included  In  the  compilation?  Not  applicable. 

(TPRC)  I,  Compilation  Formas 

1.  What  is  the  current  volume  of  the  data  compilation? 

3,322  data  a  sets. 
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Date: 

Contact: 


Purpose: 


Date: 

Contact: 


Purpose: 


Date: 

Contact: 

Purpose: 


Date: 

Contact: 

Purpose: 

Date: 

Contact: 


Project  Meetings  and  Contacts 


July  15 

Dr.  Fran*  Alt,  Dr,  Herman  M.  Weieman,  and  Dr.  Alfred  Weis  berg, 
National  Standard  Reference  Data  System,  National  Bureau  of  Stan¬ 
dards,  Washington,  D.  C. 

Mutual  project  orientation,  acquire  documents,  explore  potentials 
for  use  of  survey  data  from  usage  questionnaires  sent  to  ACS 
mailing  list. 

July  20 

Dr.  Guy  Waddington,  Director,  Office  of  International  Critical 
Tables,  National  Academy  of  Sciences  -  National  Research  Council, 
Washington,  D.  C. 

To  obtain  guidance  as  to  existing  chemical  data  compilation  acti¬ 
vities  and  to  arrange  use  of  the  library  of  the  Office  of  Interna¬ 
tional  Critical  Tables. 

July  27 

Miss  Marion  Woodruff,  Administrative  Assistant  to  Dr.  Frank 
Carman,  Technical  Director,  Manufacturing  Chemists  Association. 
Acquire  information  re  current  status  of  MCA  and  data  compila¬ 
tion  projects. 

July  28 

Dr.  Y.  S.  Touloukian,  Director,  Thennophynical  Properties  Re¬ 
search  Center,  Purdue  University,  Lafayette,  Indiana. 

Test  draft  version  of  Interview  guide  for  obtaining  information 
from  directors  of  data  compilation  activities. 

July  29 

Mr.  Harold  Rienstra,  Information  Specialist,  Systems  Develop¬ 
ment  Corporation,  Dayton,  Ohio. 
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Contact:  Mr.  Kenneth  Zabriskie,  Chemical  Abstract  Service®,  Columbus, 
Ohio. 

Purpose:  To  coordinate  our  project  effort  with  the  effort  conducted  by  the 
Chemical  Abstract  Services  under  NSF  support.  Primary  topics 
discussed  inducted  the  registry  system  and  its  potential  inter¬ 
face  with  systems  for  handling  data  on  the  properties  of  chemical 
substances  and  systems. 

Date:  August  13 

Contact:  Dr.  Herman  Weisman,  Office  of  Information  Services,  Standard 
Reference  Data  System,  National  Bureau  of  Standards,  Washing¬ 
ton,  D.  C. 

Purpose:  Obtain  preliminary  results  of  survey  of  technical  data  usage  by 
members  of  the  American  Chemical  Society  and  to  arrange  for 
use  of  the  NSRDS  library. 

Date:  September  13 

Contact:  Mr.  Daniel  N.  Lapedee,  Managing  Editor,  Encyclopedia  of  Science 

and  Technology,  McGraw-Hill,  Inc. 

Purpose:  Discuss  data  format  concepts,  topical  organization  experience  in 
encyclopedia  development. 

Date:  September  12-14 

Contact:  American  Chemical  Society  Meeting,  New  York,  New  York. 

Purpose:  Attend  sessions  concerning  chemical  data. 


Date: 

Contact: 


September  14 

Dr.  Howard  Bond,  Department  of  Chemistry,  University  of  Rhode 
Island. 
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Purpose:  Secure  advice  on  data  compilation  activities  in  toxicology  field 
that  are  desirable  candidates  for  the  project  analysis. 

Date.  September  15 

Contact:  Dr.  Bruno  J.  Zwolinski,  Chemical  Thermodynamic  Properties 
Center,  Texas  A  &  M,  College  Station,  Texes. 

Purpose:  Test  interview  questionnaire  draft. 

Date:  October  3 

Contact:  Mr.  Emil  Schafer,  Director,  and  Mr,  John  Milek,  Staff  Menber, 
Electronic  Prope.'tiej  Information  Center,  Hughes  Aircraft  Com¬ 
pany,  Culver  City,  California. 

Purpose:  Acquire  specimen  ^Materials. 

Date:  October  4~6 

Contact:  American  Documentation  Institute  Meeting,  Santa  Monica,  California. 

Purpose:  Attend  sessions  t  elated  to  pr  oject  objectives. 

Date:  October  7 

Contact:  Mr.  John  Milek,  Electronic  Properties  Information  Center, 

Hughes  Aircraft  Company,  Culver  City,  California. 

Purpose:  Discuss  data  format  concepts,  acquire  specimen  materials. 

Date:  October  10 

Contact:  Dr.  It.  E.  Hultgren,  Department  of  Metallurgy,  University  of 
California,  Berkeley,  California;  Dr.  K.  K.  Kelley,  Retired 
(Bureau  of  Mines,  Department  of  Metallurgy),  University  of  Cali¬ 
fornia,  Berkeley,  California;  and  Mr.  Brewer,  Department  of 
Chemistry,  University  of  California,  Berkeley,  California. 

Purpose-.  Acquire  specimen  materials. 

Date:  December  14 

Contact:  Dr.  S.  Rossmassler,  NSRDS,  National  Bureau  of  Standards. 


Gaithersburg,  Maryland 
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Purpose:  Discussion  and  exploration  of  specific  data  center  activities. 

Date:  December  28 

Contact:  Dr.  L.  E.  Kuentzel,  Wyandotte  Chemicals  Corporation,  Wyandotte, 

Michigan. 

Purpose:  Acquire  specimen  Materials  and  discussion  of  activities. 

Date:  December  29 

Contact:  Dr.  Roger  Simard,  Manager.  X-Ray  Powder  Diffraction  Data  Sys¬ 

tem,  ASTM,  1916  Race  Street,  Philadelphia,  Pennsylvania. 
Purpose:  Acquire  specimen  materials  and  discussion  of  activities. 


Date:  January  17 

Contact:  Dr.  V.  J.  Johnson,  Chief,  Cryogenic  Data  Center,  National  Bu¬ 
reau  of  Standards,  Boulder,  Colorado. 

Purpose:  Acquire  specimen  materials  and  discussion  of  activities. 

Date:  January  18 

Contact:  Dr.  Ralph  Hultgren,  Department  of  Metallurgy,  University  of 
California,  Berkeley,  California. 

Purpo  se:  Acquire  specimen  materials  and  discussion  of  activities. 

Date:  January  20 

Contact:  Mr.  Emil  Schafer,  Chief,  Electronic  Properties  Information  Center, 
Hughes  Aircraft  Company,  Culver  City,  California. 

Purpose:  Acquire  specimen  materials  and  discussion  of  activities. 

Date:  January  23 

Contact:  Dr.  C.  F.  Barnett,  Chief,  Atomic  and  Molecular  Processes  Data 
Center,  Oak  Ridge  National  Laboratories,  Oak  Ridge,  Tennessee. 

Purpose:  Acquire  specimen  materials  and  discussion  of  activities. 

Date:  January  24-25 

Contact:  Dr.  Bruno  J.  Zwolinski,  Chemical  The rmodynam'  -:  Properties  Cen¬ 
ter,  Texas  A  &  M,  College  Station,  Texas. 
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Washington,  O.  C.  200  07 
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Purpose:  Acquire  specimen  materials  and  discussion  of  activities. 

Date:  January  31 

Contact:  National  Science  Foundation  Symposium  on  the  Chemical  Abstract 
Services,  Washington,  D.  C. 

Purpos  :  Attend  conference  for  additional  information  on  related  aspects  of 
project  objectives. 

Date:  February  2 

Contact:  Mr.  D.  Wagman,  Chemical  Thermodynamic  Data  Group,  National 
Bureau  of  Standards,  Gaithersburg,  Maryland. 

Purpose:  Acquire  specimen  materials  and  discussion  of  activities. 

Date:  February  2 

Contact:  Dr.  D.  Stull,  JANAF,  Dow  Chemical  Company,  Midland,  Michigan. 

Purpose:  Acquire  specimen  materials  and  discussion  oi  activities. 

Date:  February  13 

Contact:  Mr.  D.  Wagman.  Chemical  Thermodynamic  Data  Group,  National 
Bureau  of  Standards.  Gaithersburg,  Maryland. 

Purpose:  Final  discussion  of  activities. 

Date:  February  14 

Contact:  Dr.  G.  Waddington,  Director,  Office  of  International  Critical 

Tables,  National  Academy  of  Sciences  -  National  Research  Council, 
Washington,  D.  C. 

Purpose:  To  obtain  further  perspective  on  critical  data  activities  being  inves¬ 
tigated. 

Date:  March  6 

Contact:  Dr.  Y.  S.  Touloukian,  Director,  and  associates.  Thermophysical 

Properties  Research  Center,  Purdue  University,  Lafayette,  Indiana. 

Purpose:  To  finalize  investigation  of  activity. 


